WO2009023865A1 - Consumer-generated media influence and sentiment determination - Google Patents

Consumer-generated media influence and sentiment determination Download PDF

Info

Publication number
WO2009023865A1
WO2009023865A1 PCT/US2008/073401 US2008073401W WO2009023865A1 WO 2009023865 A1 WO2009023865 A1 WO 2009023865A1 US 2008073401 W US2008073401 W US 2008073401W WO 2009023865 A1 WO2009023865 A1 WO 2009023865A1
Authority
WO
WIPO (PCT)
Prior art keywords
topic
data set
determining
author
network
Prior art date
Application number
PCT/US2008/073401
Other languages
French (fr)
Inventor
Miles Ward
James Webber
Dean Graziano
Original Assignee
Visible Technologies, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Visible Technologies, Inc. filed Critical Visible Technologies, Inc.
Publication of WO2009023865A1 publication Critical patent/WO2009023865A1/en
Priority to PCT/US2009/061038 priority Critical patent/WO2010065199A1/en
Priority to US12/580,667 priority patent/US9269068B2/en
Priority to US14/881,037 priority patent/US20160217488A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Definitions

  • CGM Consumer Generated Media
  • CGM may be a phrase that describes a wide variety of Internet web pages or sites, which are sometimes individually labeled as web logs or "blogs", mobile phone blogs or “mo- blogs”, video hosting blogs or “vlogs” or “vblogs”, forums, electronic discussion messages, Usenet, message boards, BBS emulating services, product review and discussion web sites, online retail sites that support customer comments, social networks, media repositories, audio and video sharing sites/networks and digital libraries.
  • Private non-Internet information systems can host CGM content as well, via environments like Sharepoint, Wiki, Jira, CRM systems, ERP systems, and advertising systems.
  • Other acronyms that describe this space are CCC (consumer created content), WSM (weblogs and social media), WOMM (Word of Mouth Media) or OWOM, (online word of mouth), and many others.
  • Keyphrase may refer to a word, string of words, or groups of words with Boolean modifiers that are used as models for discovering CGM content that might be relevant to a given topic.
  • Boolean modifiers that are used as models for discovering CGM content that might be relevant to a given topic.
  • Post may refer to a single piece of CGM content. This might be a literal weblog posting, a comment, a forum reply, a product review, or any other single element of CGM content.
  • Site may refer to an Internet site which contains CGM content.
  • Blog may refer to an Internet site which contains CGM content.
  • CGM may refer to media that resides on CGM sites.
  • CGM is often text, but includes audio files and streams (podcasts, mp3, streamcasts, Internet radio, etc.) video files and streams, animations (flash, Java) and other forms of multimedia.
  • UI may refer to a User Interface, that users interact with computer software, perform work, and review results.
  • IM may refer to an Instant Messenger, which is a class of software applications that allow direct text based communication between known peers.
  • Thread may refer to an "original” post and all of the comments connected to it, present on a blog or forum A discussion thread holds the information of content display order, so this message came first, followed by this, followed by this.
  • Permalink may refer to a URL which persistently points to an individual CGM thread
  • the Internet and other computer networks are communication systems. The sophistication of this communication has improved and the primary modes differentiated over time and technological progress. Each primary mode of online communication varies based on a combination of three basic values: privacy and persistence and control.
  • Email as a communications medium is private (communications are initially exchanged only between named recipients), persistent (saved in inboxes or mail servers) but lacks control (once you send the message, you can't take it back, or edit it, or limit re-use of it)
  • Instant messaging is p ⁇ vate, typically not persistent (some newer clients are now allowing users to save history, so this mode is changing) and lacks control.
  • Message boards are public (typically all members, and often all Internet users, can access your message) persistent, but lack control (they are typically moderated by a central owner of the board)
  • Chat rooms are public (again, some are membership based) typically not persistent, and lack control
  • Blogs and Social Networks are the predominant communications mediums that permit author control. By reducing the cost, technical sophistication, and experience required to create and administer a web site, blogs and other persistent online communication have given an unprecedented amount of editorial control to millions of online authors. This has created a unique new environment for creative expression, commentary, discourse, and criticism without the historical limits of editorial control, cost, technical expertise, or distribution/exposure.
  • This new medium represents a significant challenge for interested parties to comprehensively understand and interact with.
  • Ql 2007 estimates for the number of active, unique online CGM sites range from 50 to 71 million, with growth rates in the hundreds of thousands of new sites per day.
  • PR, Advertising and Marketing businesses and divisions interact with ( ⁇ 1000 TV channels, ⁇ 1000 radio stations, ⁇ 1000 major news publications, ⁇ 10-20 major pundits on any given subject, etc.) this represents a nearly 10,000-fold increase in the number of potential targets for interaction.
  • Businesses and other motivated communicators have come to depend on software that perform Business Intelligence, Customer Relationship Management, and Enterprise Resource Planning tasks to facilitate accelerated, organized, prioritized, tracked and analyzed interaction with customers and other target groups (voters, consumers, pundits, opinion leaders, analysts, reporters, etc.). These systems have been extended to facilitate IM, E-mail, and telephone interactions. These media have been successfully integrated because of standards (jabber, pop3, smtp, pots, imap) that require that all participant applications conform to a set data format that allows interaction with this data in a predictable way.
  • standards jabber, pop3, smtp, pots, imap
  • Blogs and other CGM generate business value for their owners, both on private sites that use custom or open source software to manage their communications, and for massive public hosts. Because these sites can generate advertising revenue, there is a drive by author/owners to protect the content on these sites, so readers/subscribers/peers have to visit the site, and become exposed to revenue generating advertising, in order to participate in/observe the communication. Because of this financial disincentive, there is no unifying standard for blogs which contains complete data. RSS and Atom feeds allow structured communication of some portion of the communication on sites, but are often very incomplete representations of the data available on a given site.
  • CAPTCHAs Completely Automated Public Turing test to tell Computers and Humans Apart
  • email verification mobile phone text message verification
  • password authentication mobile phone text message verification
  • cookie tracking Uniform Resource Locator (URL) obfuscation
  • timeouts Internet Protocol (IP) address tracking.
  • IP Internet Protocol
  • FIGURES 1-2 shows an exemplary system for consumer generated media reputation management according to an embodiment
  • FIGURE 3 shows a system for consumer generated media influence and sentiment determination according to an embodiment of the invention
  • FIGURE 4 illustrates an authority map according to an embodiment of the invention
  • FIGURE 5 illustrates a feature of an authority map according to an embodiment of the invention.
  • FIGURES 6-9 illustrate authority map features according to embodiments of the invention.
  • FIG. 1 illustrates an example of a suitable computing system environment 100 on which an embodiment of the invention may be implemented.
  • the computing system environment 100 is only one example of a suitable computing environment and is not intended to suggest any limitation as to the scope of use or functionality of embodiments of the invention. Neither should the computing environment 100 be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment 100.
  • Embodiments of the invention are operational with numerous other general-purpose or special-purpose computing-system environments or configurations.
  • Examples of well-known computing systems, environments, and/or configurations that may be suitable for use with embodiments of the invention include, but are not limited to, personal computers, server computers, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, distributed-computing environments that include any of the above systems or devices, and the like.
  • Embodiments of the invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • Embodiments of the invention may also be practiced in distributed-computing environments where tasks are performed by remote processing devices that are linked through a communications network.
  • program modules may be located in both local- and remote-computer storage media including memory storage devices.
  • an exemplary system for implementing an embodiment of the invention includes a computing device, such as computing device 100.
  • computing device 100 In its most basic configuration, computing device 100 typically includes at least one processing unit 102 and memory 104.
  • memory 104 may be volatile (such as random-access memory (RAM)), non-volatile (such as read-only memory (ROM), flash memory, etc.) or some combination of the two. This most basic configuration is illustrated in FIG. 1 by dashed line 106.
  • device 100 may have additional features/functionality.
  • device 100 may also include additional storage (removable and/or nonremovable) including, but not limited to, magnetic or optical disks or tape.
  • additional storage is illustrated in FIG. 1 by removable storage 108 and non-removable storage 110.
  • Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data
  • Memory 104, removable storage 108 and non-removable storage 110 are all examples of computer storage media
  • Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by device 100 Any such computer storage media may be part of device 100
  • Device 100 may also contain communications connection(s) 112 that allow the device to communicate with other devices
  • Communications connection(s) 112 is an example of communication media
  • Communication media typically embodies computer-readable instructions, data structures, program modules or other data in a modulated data signal such as a earner wave or other transport mechanism and includes any information delivery media
  • modulated data signal means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal
  • communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, radio-frequency (RF), infrared and other wireless media
  • RF radio-frequency
  • computer-readable media as used herein includes both storage media and communication media
  • Device 100 may also have input device(s) 114 such as keyboard, mouse, pen, voice-input device, touch-input device, etc
  • Output device(s) 116 such as a display, speakers, printer, etc may also be included All such devices are well-known m the art and need not be discussed at length here
  • System 200 includes an electronic client device 210, such as a personal computer or workstation, that is linked via a communication medium, such as a network 220 (e g , the Internet), to an electronic device or system, such as a server 230
  • the server 230 may further be coupled, or otherwise have access, to a database 240 and a computer system 260.
  • FIG. 2 includes one server 230 coupled to one client device 210 via the network 220, it should be recognized that embodiments of the invention may be implemented using one or more such client devices coupled to one or more such servers.
  • each of the client device 210 and server 230 may include all or fewer than all of the features associated with the device 100 illustrated in and discussed with reference to FIG. 1.
  • Client device 210 includes or is otherwise coupled to a computer screen or display 250.
  • client device 210 can be used for various purposes including both network- and local-computing processes.
  • the client device 210 is linked via the network 220 to server 230 so that computer programs, such as, for example, a browser, running on the client device 210 can cooperate in two-way communication with server 230.
  • Server 230 may be coupled to database 240 to retrieve information therefrom and to store information thereto.
  • Database 240 may include a plurality of different tables (not shown) that can be used by server 230 to enable performance of various aspects of embodiments of the invention.
  • the server 230 may be coupled to the computer system 260 in a manner allowing the server to delegate certain processing functions to the computer system.
  • methods and systems are implemented by a coordinated software and hardware computer system.
  • This system may include a set of dedicated networked servers controlled by an embodiment.
  • the servers may be installed with a combination of commercially available software, custom configurations, and custom software.
  • a web server is one of those modules, which exposes a web based client-side UI to customer web browsers.
  • the UI interacts with the dedicated servers to deliver information to users.
  • the cumulative logical function of these systems results in a system and method of an embodiment.
  • the servers could be placed client side, could be shared or publicly owned, could be located together or separately
  • the servers could be the aggregation of non-dedicated compute resources from a Peer to Peer (P2P), grid, or other distributed network computing environments
  • the servers could run different commercial applications, different configurations with the same or similar cumulative logical function
  • the client to this system could be run directly from the server, could be a client side executable, could reside on a mobile phone or mobile media device, could be a plug-in to other Line of Business applications or management systems
  • This system could operate in a client-less mode where only Application Programming Interface (API) or extensible Markup Language (XML) or Web-Services or other formatted network connections are made directly to the server system
  • API Application Programming Interface
  • XML extensible Markup Language
  • Web-Services or other formatted network connections
  • FIGURE 3 shows a system withm which may be implemented a method for consumer-generated media influence and sentiment determination
  • the system can be broken down into a set of modules
  • the modules may be, but are not limited to, the following collection module 275 that receives data from Internet CGM sites 270, ingestion module 280, analysis module 285, reporting module 290 and response module 295, which may provided feedback data back to sites 270, as are desc ⁇ bed m greater detail below herein
  • Embodiments of the invention may be desc ⁇ bed in the context of one or more ecosystems
  • An "ecosystem” m the context of the present application may describe online personas and locations (sites) of their interactions that can be further described by how the interactions occur, the topics of those interactions, the frequency of interactions, etc.
  • the authority map is a way to visualize the large and interconnected network of the web by helping reduce the size and scope of such an ecosystem to a consumable format.
  • an authority map 400 is illustrated, which may be displayed within a graphical user interface 401 on the display device 250.
  • the authority map 400 is a tool for identifying and understanding the authors, associated with a specified topic of interest, that matter to a particular entity using such an embodiment.
  • the displayed map 400 shows an icon 405 representing a topic being analyzed, which, as illustrated, may be displayed as a hub of a hub-and-spoke configuration, along with a textual description of the topic.
  • icons 410 representing authors of varying levels of authority or perceived influence (discussed in greater detail below herein) who have commented or otherwise posted an opinion on the displayed topic.
  • These icons 410 may further include a domain identifier associated with the author, as illustrated.
  • icons 415 representing sites of varying levels of authority or perceived influence (discussed in greater detail below herein) hosting conversations involving those authors and the displayed topic. These icons 415 may further include a domain identifier associated with the site, as illustrated.
  • each of the icons 410, 415 may be presented in a distinguishing format to indicate varying levels of authority/influence, and/or prevailing opinion or sentiment on the topic, associated with authors and sites.
  • size of the icons 410, 415 may correspond to authority/influence of the respective author or site: bigger for more authoritative, smaller for less authoritative.
  • Color, shading or pattern type of the icons 410, 415 may correspond to prevailing sentiment (e.g., green for positive, red for negative, grey for neutral, and orange for mixed).
  • Lines 420 connect the icons 410 of authors to the icons 415 of sites that host them, and from the site icons to the topic icon 405 at the center.
  • Dotted (or other distinguishing) lines 425 represent conversations or other connections occurring between authors.
  • arrows at the ends of the dotted lines 425 show the direction of interaction, pointing, for example, from commenter to original post author.
  • a criteria panel (not shown), such a pull-down menu, for example, may be used to select the topic of interest.
  • the interface 401 allows a user to get additional information about any of the nodes (icons associated with authors, sites, and topics) on the display 401. For example, and referring to FIGURE 5, by left clicking on a node, a small pop-up window 500 with additional detail about that node will appear.
  • the display allows one to promote or "pin" nodes that are of interest, which makes those items larger on the screen. Items may be pinned by clicking on the upper right hand side of the node icon.
  • the magnitude of author authority may be calculated based on data representing the topic selected by the user, using the conversations between authors and the activity generated by the commentary of a particular author (e.g., the number of comments posted in response to a comment by the author) to evaluate the author's authority.
  • This data may be calculated or otherwise determined computationally/automatically (i.e., by execution of computer-executable instructions), by human analysis, or some combination of both types of approaches.
  • the magnitude of site authority may be defined or otherwise determined in a manner similar to that used to determine the magnitude of as author authority.
  • Data representing content pertaining to a particular topic may be determined to have been written or otherwise produced by someone at a site.
  • sites having associated therewith a predetermined threshold number of comments pertaining to a particular topic may be determined to be an authoritative site.
  • the magnitude of the authority of these sites may then be determined based on, for example, the amount or volume of comment pertaining to the topic in question and associated with each respective site
  • This data may be calculated or otherwise determined computationally/automatically (i e , by execution of computer-executable instructions), by human analysis, or some combination of both types of approaches
  • Sentiment may be calculated by a weighted metric on the overall sentiment distribution, which favors "sentimented" values over neutral values four to one This ensures that a user is seeing which way an author leans when writing on a topic Counts and totals are reflective of the on-topic conversations based on the topic of interest chosen, if an author has written 200 posts, but only 5 are about the topic you're researching, the calculations will only leverage the 5 within the calculation The result is that the user can set the context m order to identify authorities in relation to that context
  • an embodiment of the authority map is a se ⁇ es of calculations As raw data comes in from collection, the data is processed and analyzed m several ways Each unique post or comment is first matched to one or more topics of interest leveraging term-based definitions For each topic matched, a sentiment is assigned using either manual attribution or computational attribution Computational attnbunon of sentiment is achieved using technology that correlates patterns between a set of known pieces of content that represent the sentiment for a topic to the individual piece of content being analyzed For example, an embodiment uses text parsing m conjunction with Bayesian inference m order to assign a probability that a post exists within each of a neutral or sentimented "states " Each state is represented by a definition derived from groups of posts that are characteristic of that state The companson is done using the state definitions that are stored in an index resident on the client device 210 and/or server 230 and/or database 240 and comparing that state definition with the content in question Alternatively, or additionally, an embodiment uses keyword/keyphrase/
  • the dominant sentiment is calculated by a weighted metric on the overall sentiment distribution across all posts that match the topic being analyzed, weighting "sentimented" values over neutral values in a 4:1 ratio.
  • the posts For authors, the posts not only match the topic, but have also been written by the author of interest.
  • the posts For sites, the posts not only match the topic, but have also been written at the site of interest.
  • Authority is then calculated based on the data representing the topic selected by the user, using the conversations between authors and the activity (post counts) to evaluate the author's (or site's) Authority.
  • embodiments of an authority map include but are not limited to the following features:
  • Alternative embodiments may include:
  • Inbound authors are those that comment on a given author's original post

Abstract

A method implementable in at least one electronic device coupled to a network and a display device, includes receiving, over the network, a data set, receiving, from a user, a selection of a first topic, determining, based on the data set, a plurality of network sites hosting commentary of the first topic and an authority level of each site of the plurality, determining, based on the data set, an authority level of each site of the plurality, determining, based on the data set, a plurality of authors providing the commentary hosted by the plurality of network sites, determining, based on the data set, an authority level of each author of the plurality, and determining, based on the data set, a value characterizing an opinion of each author on the first topic.

Description

CONSUMER-GENERATED MEDIA INFLUENCE
AND SENTIMENT DETERMINATION
PRIORITY CLAIM
[0001] This application claims the benefit of U.S. Provisional Application Serial No. 60/965,067 and U.S. Provisional Application Serial No. 60/956,097 filed August 15, 2007. Each of the foregoing applications is hereby incorporated by reference in their entirety as if fully set forth herein.
COPYRIGHT NOTICE
[0002] This disclosure is protected under United States and International Copyright Laws. © 2006-2008 Visible Technologies. All Rights Reserved. A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure after formal publication by the USPTO5 as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
BACKGROUND OF THE INVENTION
[0003] As used herein, the term "Consumer Generated Media" (hereinafter CGM) may be a phrase that describes a wide variety of Internet web pages or sites, which are sometimes individually labeled as web logs or "blogs", mobile phone blogs or "mo- blogs", video hosting blogs or "vlogs" or "vblogs", forums, electronic discussion messages, Usenet, message boards, BBS emulating services, product review and discussion web sites, online retail sites that support customer comments, social networks, media repositories, audio and video sharing sites/networks and digital libraries. Private non-Internet information systems can host CGM content as well, via environments like Sharepoint, Wiki, Jira, CRM systems, ERP systems, and advertising systems. Other acronyms that describe this space are CCC (consumer created content), WSM (weblogs and social media), WOMM (Word of Mouth Media) or OWOM, (online word of mouth), and many others.
[0004] As used herein, the term "Keyphrase" may refer to a word, string of words, or groups of words with Boolean modifiers that are used as models for discovering CGM content that might be relevant to a given topic. Could also be an example image, audio file or video file that has characteristics that would be used for content discovery and matching.
[0005] As used herein, the term "Post" may refer to a single piece of CGM content. This might be a literal weblog posting, a comment, a forum reply, a product review, or any other single element of CGM content.
[0006] As used herein, the term "Site" may refer to an Internet site which contains CGM content.
[0007] As used herein, the term "Blog" may refer to an Internet site which contains CGM content.
[0008] As used herein, the term "Content" may refer to media that resides on CGM sites. CGM is often text, but includes audio files and streams (podcasts, mp3, streamcasts, Internet radio, etc.) video files and streams, animations (flash, Java) and other forms of multimedia.
[0009] As used herein, the term "UI" may refer to a User Interface, that users interact with computer software, perform work, and review results.
[0010] As used herein, the term "IM" may refer to an Instant Messenger, which is a class of software applications that allow direct text based communication between known peers. [0011] As used herein, the term "Thread" may refer to an "original" post and all of the comments connected to it, present on a blog or forum A discussion thread holds the information of content display order, so this message came first, followed by this, followed by this.
[0012] As used herein, the term "Permalink" may refer to a URL which persistently points to an individual CGM thread
[0013] The Internet and other computer networks are communication systems. The sophistication of this communication has improved and the primary modes differentiated over time and technological progress. Each primary mode of online communication varies based on a combination of three basic values: privacy and persistence and control. Email as a communications medium is private (communications are initially exchanged only between named recipients), persistent (saved in inboxes or mail servers) but lacks control (once you send the message, you can't take it back, or edit it, or limit re-use of it) Instant messaging is pπvate, typically not persistent (some newer clients are now allowing users to save history, so this mode is changing) and lacks control. Message boards are public (typically all members, and often all Internet users, can access your message) persistent, but lack control (they are typically moderated by a central owner of the board) Chat rooms are public (again, some are membership based) typically not persistent, and lack control
Figure imgf000005_0001
[0014] Blogs and Social Networks are the predominant communications mediums that permit author control. By reducing the cost, technical sophistication, and experience required to create and administer a web site, blogs and other persistent online communication have given an unprecedented amount of editorial control to millions of online authors. This has created a unique new environment for creative expression, commentary, discourse, and criticism without the historical limits of editorial control, cost, technical expertise, or distribution/exposure.
[0015] There is significant value in the information contained within this public media. Because the opinions, topics of discussion, brands and celebrities mentioned and relationships evinced are typically totally unsolicited, the information presented, if well studied, represents an amazing new source of social insight, consumer feedback, opinion measurement, popularity analysis and messaging data. It also represents a fully exposed, granular network of peer and hierarchical relationships rich with authority and influence. The marketing, advertising, and PR value of this information is unprecedented.
[0016] This new medium represents a significant challenge for interested parties to comprehensively understand and interact with. As of Ql 2007 estimates for the number of active, unique online CGM sites (forums, blogs, social networks, etc.) range from 50 to 71 million, with growth rates in the hundreds of thousands of new sites per day. Compared to the typical mediums that PR, Advertising and Marketing businesses and divisions interact with (<1000 TV channels, <1000 radio stations, <1000 major news publications, < 10-20 major pundits on any given subject, etc.) this represents a nearly 10,000-fold increase in the number of potential targets for interaction.
[0017] Businesses and other motivated communicators have come to depend on software that perform Business Intelligence, Customer Relationship Management, and Enterprise Resource Planning tasks to facilitate accelerated, organized, prioritized, tracked and analyzed interaction with customers and other target groups (voters, consumers, pundits, opinion leaders, analysts, reporters, etc.). These systems have been extended to facilitate IM, E-mail, and telephone interactions. These media have been successfully integrated because of standards (jabber, pop3, smtp, pots, imap) that require that all participant applications conform to a set data format that allows interaction with this data in a predictable way.
[0018] Blogs and other CGM generate business value for their owners, both on private sites that use custom or open source software to manage their communications, and for massive public hosts. Because these sites can generate advertising revenue, there is a drive by author/owners to protect the content on these sites, so readers/subscribers/peers have to visit the site, and become exposed to revenue generating advertising, in order to participate in/observe the communication. Because of this financial disincentive, there is no unifying standard for blogs which contains complete data. RSS and Atom feeds allow structured communication of some portion of the communication on sites, but are often very incomplete representations of the data available on a given site. Sites also protect their content from being "stolen" by automated systems with an array of CAPTCHAs, ("Completely Automated Public Turing test to tell Computers and Humans Apart") email verification, mobile phone text message verification, password authentication, cookie tracking, Uniform Resource Locator (URL) obfuscation, timeouts and Internet Protocol (IP) address tracking.
[0019] The result is a massively diverse community that it would be very valuable to understand and interact with, which resists aggregation and unified interaction by way of significant technical diversity, resistance to complete information data standards, and tests that attempt to require one-to-one human interaction with content.
BRIEF DESCRIPTION OF THE DRAWINGS
[0020] The preferred and alternative embodiments of the present invention are described in detail below with reference to the following drawings. [0021] FIGURES 1-2 shows an exemplary system for consumer generated media reputation management according to an embodiment;
[0022] FIGURE 3 shows a system for consumer generated media influence and sentiment determination according to an embodiment of the invention;
[0023] FIGURE 4 illustrates an authority map according to an embodiment of the invention;
[0024] FIGURE 5 illustrates a feature of an authority map according to an embodiment of the invention; and
[0025] FIGURES 6-9 illustrate authority map features according to embodiments of the invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT [0026] FIG. 1 illustrates an example of a suitable computing system environment 100 on which an embodiment of the invention may be implemented. The computing system environment 100 is only one example of a suitable computing environment and is not intended to suggest any limitation as to the scope of use or functionality of embodiments of the invention. Neither should the computing environment 100 be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment 100.
[0027] Embodiments of the invention are operational with numerous other general-purpose or special-purpose computing-system environments or configurations. Examples of well-known computing systems, environments, and/or configurations that may be suitable for use with embodiments of the invention include, but are not limited to, personal computers, server computers, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, distributed-computing environments that include any of the above systems or devices, and the like.
[0028] Embodiments of the invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Embodiments of the invention may also be practiced in distributed-computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed-computing environment, program modules may be located in both local- and remote-computer storage media including memory storage devices.
[0029] With reference to FIG. 1, an exemplary system for implementing an embodiment of the invention includes a computing device, such as computing device 100. In its most basic configuration, computing device 100 typically includes at least one processing unit 102 and memory 104.
[0030] Depending on the exact configuration and type of computing device, memory 104 may be volatile (such as random-access memory (RAM)), non-volatile (such as read-only memory (ROM), flash memory, etc.) or some combination of the two. This most basic configuration is illustrated in FIG. 1 by dashed line 106.
[0031] Additionally, device 100 may have additional features/functionality. For example, device 100 may also include additional storage (removable and/or nonremovable) including, but not limited to, magnetic or optical disks or tape. Such additional storage is illustrated in FIG. 1 by removable storage 108 and non-removable storage 110. Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data Memory 104, removable storage 108 and non-removable storage 110 are all examples of computer storage media Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by device 100 Any such computer storage media may be part of device 100
[0032] Device 100 may also contain communications connection(s) 112 that allow the device to communicate with other devices Communications connection(s) 112 is an example of communication media Communication media typically embodies computer-readable instructions, data structures, program modules or other data in a modulated data signal such as a earner wave or other transport mechanism and includes any information delivery media The term "modulated data signal" means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal By way of example, and not limitation, communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, radio-frequency (RF), infrared and other wireless media The term computer-readable media as used herein includes both storage media and communication media
[0033] Device 100 may also have input device(s) 114 such as keyboard, mouse, pen, voice-input device, touch-input device, etc Output device(s) 116 such as a display, speakers, printer, etc may also be included All such devices are well-known m the art and need not be discussed at length here
[0034] Referring now to FIG 2, an embodiment of the present invention can be described in the context of an exemplary computer network system 200 as illustrated System 200 includes an electronic client device 210, such as a personal computer or workstation, that is linked via a communication medium, such as a network 220 (e g , the Internet), to an electronic device or system, such as a server 230 The server 230 may further be coupled, or otherwise have access, to a database 240 and a computer system 260. Although the embodiment illustrated in FIG. 2 includes one server 230 coupled to one client device 210 via the network 220, it should be recognized that embodiments of the invention may be implemented using one or more such client devices coupled to one or more such servers.
[0035] In an embodiment, each of the client device 210 and server 230 may include all or fewer than all of the features associated with the device 100 illustrated in and discussed with reference to FIG. 1. Client device 210 includes or is otherwise coupled to a computer screen or display 250. As is well known in the art, client device 210 can be used for various purposes including both network- and local-computing processes.
[0036] The client device 210 is linked via the network 220 to server 230 so that computer programs, such as, for example, a browser, running on the client device 210 can cooperate in two-way communication with server 230. Server 230 may be coupled to database 240 to retrieve information therefrom and to store information thereto. Database 240 may include a plurality of different tables (not shown) that can be used by server 230 to enable performance of various aspects of embodiments of the invention. Additionally, the server 230 may be coupled to the computer system 260 in a manner allowing the server to delegate certain processing functions to the computer system.
[0037] In at least one embodiment, methods and systems are implemented by a coordinated software and hardware computer system. This system may include a set of dedicated networked servers controlled by an embodiment. The servers may be installed with a combination of commercially available software, custom configurations, and custom software. A web server is one of those modules, which exposes a web based client-side UI to customer web browsers. The UI interacts with the dedicated servers to deliver information to users. The cumulative logical function of these systems results in a system and method of an embodiment. [0038] In alternate embodiments, the servers could be placed client side, could be shared or publicly owned, could be located together or separately The servers could be the aggregation of non-dedicated compute resources from a Peer to Peer (P2P), grid, or other distributed network computing environments The servers could run different commercial applications, different configurations with the same or similar cumulative logical function The client to this system could be run directly from the server, could be a client side executable, could reside on a mobile phone or mobile media device, could be a plug-in to other Line of Business applications or management systems This system could operate in a client-less mode where only Application Programming Interface (API) or extensible Markup Language (XML) or Web-Services or other formatted network connections are made directly to the server system These outside consumers could be installed on the same servers as the custom application components The custom server- side engine applications could be written in different languages, using different constructs, foundations, architectural methodologies, storage and processing behaviors while retaining the same or similar cumulative logical function The UI could be built in different languages, using different constructs, foundations, architectural methodologies, storage and processing behaviors while retaining the same or similar cumulative logical function
[0039] FIGURE 3 shows a system withm which may be implemented a method for consumer-generated media influence and sentiment determination The system can be broken down into a set of modules The modules may be, but are not limited to, the following collection module 275 that receives data from Internet CGM sites 270, ingestion module 280, analysis module 285, reporting module 290 and response module 295, which may provided feedback data back to sites 270, as are descπbed m greater detail below herein
[0040] Embodiments of the invention may be descπbed in the context of one or more ecosystems An "ecosystem" m the context of the present application may describe online personas and locations (sites) of their interactions that can be further described by how the interactions occur, the topics of those interactions, the frequency of interactions, etc. The authority map is a way to visualize the large and interconnected network of the web by helping reduce the size and scope of such an ecosystem to a consumable format.
[0041] In an embodiment, and referring now to FIGURE 4, an authority map 400 is illustrated, which may be displayed within a graphical user interface 401 on the display device 250. The authority map 400 is a tool for identifying and understanding the authors, associated with a specified topic of interest, that matter to a particular entity using such an embodiment. In the illustrated embodiment, the displayed map 400 shows an icon 405 representing a topic being analyzed, which, as illustrated, may be displayed as a hub of a hub-and-spoke configuration, along with a textual description of the topic. Also displayed are icons 410 representing authors of varying levels of authority or perceived influence (discussed in greater detail below herein) who have commented or otherwise posted an opinion on the displayed topic. These icons 410 may further include a domain identifier associated with the author, as illustrated. Also displayed are icons 415 representing sites of varying levels of authority or perceived influence (discussed in greater detail below herein) hosting conversations involving those authors and the displayed topic. These icons 415 may further include a domain identifier associated with the site, as illustrated.
[0042] In an embodiment, each of the icons 410, 415 may be presented in a distinguishing format to indicate varying levels of authority/influence, and/or prevailing opinion or sentiment on the topic, associated with authors and sites. For example, size of the icons 410, 415 may correspond to authority/influence of the respective author or site: bigger for more authoritative, smaller for less authoritative. Color, shading or pattern type of the icons 410, 415 may correspond to prevailing sentiment (e.g., green for positive, red for negative, grey for neutral, and orange for mixed). Lines 420 connect the icons 410 of authors to the icons 415 of sites that host them, and from the site icons to the topic icon 405 at the center. Dotted (or other distinguishing) lines 425 represent conversations or other connections occurring between authors. In an embodiment, arrows at the ends of the dotted lines 425 show the direction of interaction, pointing, for example, from commenter to original post author.
[0043] To populate the map 400, a criteria panel (not shown), such a pull-down menu, for example, may be used to select the topic of interest. The interface 401 allows a user to get additional information about any of the nodes (icons associated with authors, sites, and topics) on the display 401. For example, and referring to FIGURE 5, by left clicking on a node, a small pop-up window 500 with additional detail about that node will appear. The display allows one to promote or "pin" nodes that are of interest, which makes those items larger on the screen. Items may be pinned by clicking on the upper right hand side of the node icon.
[0044] Further included within an embodiment of the authority map is a series of calculations. For example, in an embodiment, the magnitude of author authority may be calculated based on data representing the topic selected by the user, using the conversations between authors and the activity generated by the commentary of a particular author (e.g., the number of comments posted in response to a comment by the author) to evaluate the author's authority. This data may be calculated or otherwise determined computationally/automatically (i.e., by execution of computer-executable instructions), by human analysis, or some combination of both types of approaches.
[0045] The magnitude of site authority may be defined or otherwise determined in a manner similar to that used to determine the magnitude of as author authority. Data representing content pertaining to a particular topic may be determined to have been written or otherwise produced by someone at a site. As such, sites having associated therewith a predetermined threshold number of comments pertaining to a particular topic may be determined to be an authoritative site. The magnitude of the authority of these sites may then be determined based on, for example, the amount or volume of comment pertaining to the topic in question and associated with each respective site This data may be calculated or otherwise determined computationally/automatically (i e , by execution of computer-executable instructions), by human analysis, or some combination of both types of approaches
[0046] Sentiment may be calculated by a weighted metric on the overall sentiment distribution, which favors "sentimented" values over neutral values four to one This ensures that a user is seeing which way an author leans when writing on a topic Counts and totals are reflective of the on-topic conversations based on the topic of interest chosen, if an author has written 200 posts, but only 5 are about the topic you're researching, the calculations will only leverage the 5 within the calculation The result is that the user can set the context m order to identify authorities in relation to that context
[0047] Further included within an embodiment of the authority map is a seπes of calculations As raw data comes in from collection, the data is processed and analyzed m several ways Each unique post or comment is first matched to one or more topics of interest leveraging term-based definitions For each topic matched, a sentiment is assigned using either manual attribution or computational attribution Computational attnbunon of sentiment is achieved using technology that correlates patterns between a set of known pieces of content that represent the sentiment for a topic to the individual piece of content being analyzed For example, an embodiment uses text parsing m conjunction with Bayesian inference m order to assign a probability that a post exists within each of a neutral or sentimented "states " Each state is represented by a definition derived from groups of posts that are characteristic of that state The companson is done using the state definitions that are stored in an index resident on the client device 210 and/or server 230 and/or database 240 and comparing that state definition with the content in question Alternatively, or additionally, an embodiment uses keyword/keyphrase/keysentence recognition in conjunction with an index, for example, that correlates a sentiment value with a particular or group of keyword/keyphrase/key sentence to determine an author's opinion on a topic.
[0048] When displaying an author or site's sentiment in the Authority Map, the dominant sentiment is calculated by a weighted metric on the overall sentiment distribution across all posts that match the topic being analyzed, weighting "sentimented" values over neutral values in a 4:1 ratio. For authors, the posts not only match the topic, but have also been written by the author of interest. For sites, the posts not only match the topic, but have also been written at the site of interest. Authority is then calculated based on the data representing the topic selected by the user, using the conversations between authors and the activity (post counts) to evaluate the author's (or site's) Authority. Therefore, calculations are reflective of the on-topic conversations, computed relative to the topic ecosystem being analyzed; if an author has written 200 posts, but only 5 are about the topic you're researching, the calculations will only leverage the 5 within the calculation. The result is that the user can set the context in order to identify authorities in relation to that context.
[0049] Referring to FIGURES 6-9, embodiments of an authority map include but are not limited to the following features:
[0050] Single topic representation with a topic selector for context
[0051] Color-coded sentiment visualization rolled up to Authors and Sites
[0052] Authority represented by icon size
[0053] Topic- Site linkage
[0054] Site- Author linkage
[0055] Author- Author linkage
[0056] Mouse-over tool tip with data stats
[0057] Alternative embodiments may include:
[0058] • Sliding scale to allow user to choose the number of authors displayed
[0059] • Date and Site Domain Filters [0060] • Data Drill down capabilities that allows users to view the data behind the calculations
[0061] • 3 different authority calculations
[0062] o Activity (Overall volumes of content)
[0063] o Pull (Unique Inbound Authors)
[0064] Inbound authors are those that comment on a given author's original post
[0065] o Reach (Unique Outbound Authors)
[0066] Outbound authors are those that given author has commented on
[0067] • Mini map navigation tool
[0068] o Zoom navigation
[0069] o Landscape panning
[0070] • Graph versus List View
[0071] • 3 new authority calculations
[0072] o Authorship (Volume of Original Posts)
[0073] o Participation (Volume of Commentary)
[0074] o Influence (Weighted metric of Activity, Pull and Reach)
[0075] While the preferred embodiment of the invention has been illustrated and described, as noted above, many changes can be made without departing from the spirit and scope of the invention. Accordingly, the scope of the invention is not limited by the disclosure of the preferred embodiment. Instead, the invention should be determined entirely by reference to the claims that follow.

Claims

What is claimed is:
1. A method implementable in at least one electronic device coupled to a network and a display device, comprising the steps of: receiving, over the network, a data set; generating to the display device a graphical user interface (GUI) including a menu of topics selectable by a user of the GUI; receiving, from the user, a selection of a first topic of the menu; determining, based on the data set, a plurality of network sites hosting commentary of the first topic and an authority level of each site of the plurality; determining, based on the data set, an authority level of each site of the plurality; determining, based on the data set, a plurality of authors providing the commentary hosted by the plurality of network sites; determining, based on the data set, an authority level of each author of the plurality; determining, based on the data set, a value characterizing an opinion of each author on the first topic; and in response to the user selection, generating within the GUI a set of icons representing the plurality of sites and the plurality of authors, the icons being presented in multiple presentation formats based on the determined authority levels and opinion values.
PCT/US2008/073401 2006-05-05 2008-08-15 Consumer-generated media influence and sentiment determination WO2009023865A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
PCT/US2009/061038 WO2010065199A1 (en) 2008-08-15 2009-10-16 Systems and methods for consumer-generated media reputation management
US12/580,667 US9269068B2 (en) 2006-05-05 2009-10-16 Systems and methods for consumer-generated media reputation management
US14/881,037 US20160217488A1 (en) 2007-05-07 2015-10-12 Systems and methods for consumer-generated media reputation management

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US95609707P 2007-08-15 2007-08-15
US96506707P 2007-08-15 2007-08-15
US60/965,067 2007-08-15
US60/956,097 2007-08-15

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/745,390 Continuation-In-Part US7720835B2 (en) 2006-05-05 2007-05-07 Systems and methods for consumer-generated media reputation management

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US12/192,919 Continuation-In-Part US20090070683A1 (en) 2006-05-05 2008-08-15 Consumer-generated media influence and sentiment determination

Publications (1)

Publication Number Publication Date
WO2009023865A1 true WO2009023865A1 (en) 2009-02-19

Family

ID=40351197

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/073401 WO2009023865A1 (en) 2006-05-05 2008-08-15 Consumer-generated media influence and sentiment determination

Country Status (1)

Country Link
WO (1) WO2009023865A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013003961A2 (en) * 2011-07-07 2013-01-10 International Business Machines Corporation System and method for determining interpersonal relationship influence information using textual content from interpersonal interactions
EP2560111A3 (en) * 2011-08-15 2013-05-15 Lockheed Martin Corporation Systems and methods for facilitating the gathering of open source intelligence
US8620849B2 (en) 2010-03-10 2013-12-31 Lockheed Martin Corporation Systems and methods for facilitating open source intelligence gathering
US9418389B2 (en) 2012-05-07 2016-08-16 Nasdaq, Inc. Social intelligence architecture using social media message queues
US10304036B2 (en) 2012-05-07 2019-05-28 Nasdaq, Inc. Social media profiling for one or more authors using one or more social media platforms

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060004691A1 (en) * 2004-06-30 2006-01-05 Technorati Inc. Ecosystem method of aggregation and search and related techniques
US20060287989A1 (en) * 2005-06-16 2006-12-21 Natalie Glance Extracting structured data from weblogs
US20070100875A1 (en) * 2005-11-03 2007-05-03 Nec Laboratories America, Inc. Systems and methods for trend extraction and analysis of dynamic data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060004691A1 (en) * 2004-06-30 2006-01-05 Technorati Inc. Ecosystem method of aggregation and search and related techniques
US20060287989A1 (en) * 2005-06-16 2006-12-21 Natalie Glance Extracting structured data from weblogs
US20070100875A1 (en) * 2005-11-03 2007-05-03 Nec Laboratories America, Inc. Systems and methods for trend extraction and analysis of dynamic data

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8935197B2 (en) 2010-03-10 2015-01-13 Lockheed Martin Corporation Systems and methods for facilitating open source intelligence gathering
US8620849B2 (en) 2010-03-10 2013-12-31 Lockheed Martin Corporation Systems and methods for facilitating open source intelligence gathering
US9348934B2 (en) 2010-03-10 2016-05-24 Lockheed Martin Corporation Systems and methods for facilitating open source intelligence gathering
GB2507215A (en) * 2011-07-07 2014-04-23 Ibm System and method for determining interpersonal relationship influence information using textual content from interpersonal interactions
WO2013003961A3 (en) * 2011-07-07 2013-05-02 International Business Machines Corporation System and method for determining interpersonal relationship influence information using textual content from interpersonal interactions
WO2013003961A2 (en) * 2011-07-07 2013-01-10 International Business Machines Corporation System and method for determining interpersonal relationship influence information using textual content from interpersonal interactions
US10235421B2 (en) 2011-08-15 2019-03-19 Lockheed Martin Corporation Systems and methods for facilitating the gathering of open source intelligence
EP2560111A3 (en) * 2011-08-15 2013-05-15 Lockheed Martin Corporation Systems and methods for facilitating the gathering of open source intelligence
US8650198B2 (en) 2011-08-15 2014-02-11 Lockheed Martin Corporation Systems and methods for facilitating the gathering of open source intelligence
US9418389B2 (en) 2012-05-07 2016-08-16 Nasdaq, Inc. Social intelligence architecture using social media message queues
US10304036B2 (en) 2012-05-07 2019-05-28 Nasdaq, Inc. Social media profiling for one or more authors using one or more social media platforms
US11086885B2 (en) 2012-05-07 2021-08-10 Nasdaq, Inc. Social intelligence architecture using social media message queues
US11100466B2 (en) 2012-05-07 2021-08-24 Nasdaq, Inc. Social media profiling for one or more authors using one or more social media platforms
US11803557B2 (en) 2012-05-07 2023-10-31 Nasdaq, Inc. Social intelligence architecture using social media message queues
US11847612B2 (en) 2012-05-07 2023-12-19 Nasdaq, Inc. Social media profiling for one or more authors using one or more social media platforms

Similar Documents

Publication Publication Date Title
US20090070683A1 (en) Consumer-generated media influence and sentiment determination
US10235016B2 (en) Systems and methods for consumer-generated media reputation management
US10572552B2 (en) Systems and methods for consumer-generated media reputation management
US7720835B2 (en) Systems and methods for consumer-generated media reputation management
US9269068B2 (en) Systems and methods for consumer-generated media reputation management
US20160217488A1 (en) Systems and methods for consumer-generated media reputation management
AU2007257092B2 (en) Systems and methods for consumer-generated media reputation management
US20170104699A1 (en) System and Method for Sharing Content in an Instant Messaging Application
US20150120583A1 (en) Process and mechanism for identifying large scale misuse of social media networks
US20060059164A1 (en) Online dating service enabling testimonials for a service subscriber
US20020138582A1 (en) Methods and apparatus providing electronic messages that are linked and aggregated
US20100036856A1 (en) Method and system of tagging email and providing tag clouds
US10038658B2 (en) Communication streams
US9584565B1 (en) Methods for generating notifications in a shared workspace
KR20120087972A (en) Mechanism for adding content from a search to a document or message
WO2009102533A1 (en) User interface for reading email conversations
Mohammed Security in cloud computing: an analysis of key drivers and constraints
WO2009023865A1 (en) Consumer-generated media influence and sentiment determination

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08798044

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 17/06/2010)

122 Ep: pct application non-entry in european phase

Ref document number: 08798044

Country of ref document: EP

Kind code of ref document: A1