WO2009082100A9 - Method and system for searching information of collective emotion based on comments about contents on internet - Google Patents

Method and system for searching information of collective emotion based on comments about contents on internet Download PDF

Info

Publication number
WO2009082100A9
WO2009082100A9 PCT/KR2008/007228 KR2008007228W WO2009082100A9 WO 2009082100 A9 WO2009082100 A9 WO 2009082100A9 KR 2008007228 W KR2008007228 W KR 2008007228W WO 2009082100 A9 WO2009082100 A9 WO 2009082100A9
Authority
WO
WIPO (PCT)
Prior art keywords
content
impression
emotional
score
search
Prior art date
Application number
PCT/KR2008/007228
Other languages
French (fr)
Other versions
WO2009082100A2 (en
WO2009082100A4 (en
WO2009082100A3 (en
Inventor
Soung-Joo Han
Original Assignee
Soung-Joo Han
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Soung-Joo Han filed Critical Soung-Joo Han
Priority to US12/679,011 priority Critical patent/US20100262597A1/en
Publication of WO2009082100A2 publication Critical patent/WO2009082100A2/en
Publication of WO2009082100A3 publication Critical patent/WO2009082100A3/en
Publication of WO2009082100A4 publication Critical patent/WO2009082100A4/en
Publication of WO2009082100A9 publication Critical patent/WO2009082100A9/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing

Definitions

  • the present invention relates to information search method and system using a
  • search method and system that provide a list of content, which is sorted by a proper ranking
  • search engine has merely provided content with text information (e.g.,
  • An object of the present invention is to introduce information search method
  • the system collects comments about content on the Internet
  • the invention produces information search method
  • feeling about content is similar to another's feeling about it. For example, when one feels a
  • the invention aggressively makes use of comments posted by users that appreciated content.
  • the invention constructs a database from the information. Using the database, it provides a method and system for retrieving information matching a query including an emotional word in order to solve the problem.
  • the present invention provides a method for searching information of collective emotion, which comprises the following processes.
  • a main server in the system constructs a search database in which impression score tables are stored (SlOl), where each row of an impression score table consists of two fields: one is the name of an item in which emotional words are categorized, and the other is its value (See FIG. 7).
  • the server receives a search query from a user (S 102).
  • the server separates and extracts non-emotional word(s) and emotional ones(s) from the transferred query (S 103).
  • the server finds content relevant to the non-emotional words in the search database (S 104). In this step, if there is no non-emotional word in the query, step S 104 may be omitted.
  • the server finds out which of impression classes in an impression classification table the extracted emotional word(s) belongs to (S 105).
  • the server determines whether an item, which matches the found impression class in S 105, in each impression score table of the content, which has been found in step S 104, is checked or a score is assigned in the item (S 106).
  • the server adjusts the ranks of the found content according to a predetermined method dependent on the "checked" values or the scores (S 107).
  • the method of adjusting the rank will be explained later in exemplary embodiments.
  • the server makes the user's terminal display the adjusted search result (S108).
  • Step SlOl comprises the following sub-steps.
  • the server collects documents with comments on the Internet (S201); the server extracts comments from the collected documents (S202). More particularly, the server collects documents with comments by using a web robot that automatically selects and collects fit information from web documents on the Internet, and extracts the comments from the collected documents.
  • the server searches extracted comments for emotional words (S203). More particularly, the server separates and extracts emotional words (or phrases) from the extracted comments by using processing such as morphological analysis and word stemming. After that, the server finds out which of impression classes in an impression classification table each of the found emotional words of the content belongs to (S204). Then, the server checks corresponding items in the impression score tables of the content or assigns scores to them (S205).
  • the impression classification table (See FIG. 3) means a table in which emotional words are classified and itemized. For example, the impression classification table in FIG. 3 shows that the emotional word “angry” belongs to the impression item "pleasant/angry.”
  • the names of items in the table may be set to a diversity of adjectives (or adverbs). or instance, the names are set into "glad, angry, sorrowful, pleasant, delicious, hateful, desirable, beautiful, ugly, good and nicely.”
  • the classification method is not fixed. On the contrary, it may be changed.
  • items in the table can be classified either briefly or in detail. For example, “lovely” and “cute” are put into the same category.
  • a score may be assigned to an item in the table as well as the item can be checked.
  • scores in the items in the table may be assigned according to the number of emotional comments (or the number of users that posted the comments) and the intensity of feelings. Methods of assigning the scores can be as follows.
  • the score may be adjusted by users' recommending (or assenting to) or dissenting from comments. Or, it may be adjusted by intensity of a feeling that is computed according to users' rating content, not text-based comment.
  • feelings of a kind and the opposite feelings may be categorized into the same item. And then words related to the opposite feelings decrease the score field of the item. For example, “joyful” and “sorrowful” are opposite to each other but distinguished from other feelings. Thus, they can be categorized into an item; emotional words related to "joyful” increase the score of the item. And emotional words related to "sorrowful" decrease the score.
  • a score according to the word may be assigned to plural items in an impression score table.
  • the emotional word “magnificent” means both “grand” and “gorgeous.” Therefore a score of "magnificent” is assigned to two items to which "grand" and "gorgeous” belong.
  • the server stores information about the content and the impression score tables, or metadata thereof (see FIG. 7) into the search database (S206). Thereby constructing the database (SlOl) has been finished.
  • the above information about content includes index terms, the URL of a webpage containing the content, the URL of the content, ranking number(s) related to the content and so on, as shown in FIG. 6.
  • the following is illustrating constructing the above-stated database (SlOl).
  • SaOl above-stated database
  • the server stores the impression score table of the content, information about it (e.g., URI, URL, condensed information or content itself) and information about documents related to it (e.g., text in the webpage) into the database, where an item named "pretty” in the impression score table is checked.
  • content and words (or phrases) in documents related to them e.g., web pages
  • expected phrases to combine emotional words (or phrases) and non-emotional words (or phrases) may be indexed or ranked in advance.
  • comments about content may be considered as a part of the document.
  • the indexing (strictly speaking, inverted indexing) and ranking for the search engine may be processed according to the present invention or other search methods.
  • objects to be indexed include words (, word groups or phrases) in content or documents, but not limited thereto. Thus, comments (including emotional words and non-emotional ones) attached to content or documents may be indexed.
  • step S 102 the server receives a search query from a user. More particularly, a user sends a search query including an emotional word to the server using the user's terminal.
  • step S 103 the server separates and extracts emotional word(s) and non-emotional one(s) from the transmitted query. More particularly, the server separates and extracts emotional word(s) and non-emotional one(s) by using processing such as morphological analysis and word stemming. If only an emotional word is in the query, it is self-evident that only the emotional word will be separated and extracted.
  • step S 104 the server finds content relevant to the extracted non-emotional word(s) in the search database. More particularly, the server finds an index term that matches the non-emotional word(s) in the database and then a list of content to which the index term points is found in the database.
  • FIG. 6 shows that if a separated non-emotional word (or phrase) is "dance music," web pages A and B where the phrase occurs are found.
  • step S 105 the server finds out which of impression classes in an impression classification table the emotional word(s) belongs to, where the emotional word has been separated from the search query in step S 103.
  • FIG. 3 shows that if the separated emotional word is "boring,” it belongs to the item "interesting/boring" in the table.
  • step S 106 the server determines whether an item, which matches the found impression class, in each impression score table of the content found in step S 104 has been checked. To put it in another way, it looks up an item, in the impression score table, corresponding to the item in the impression classification table, which has been set according to the emotional word(s); it examines the value of the very item of each impression score table of the content, which has been found according to the non-emotional words. Also in the case where a score is assigned to the item, the process is the same as the above-stated that. However, if there is no non-emotional word in the search query, step S 104 will be omitted and the server finds all content whose the corresponding items are checked or have scores.
  • step S 107 the server adjusts the ranks of the found content according to the "checked/unchecked" values of the matched items.
  • the ranks of them are thus adjusted.
  • the ranks of the above-stated process are adjusted according to the score. The following are illustrating the rank adjusting methods.
  • the server When a user entered the search query "cute baby photo,” firstly the server finds content relevant to the non-emotional word (or phrase) "baby photo.” Then, the server raises the ranks of content whose the "cute” items, in the impression score tables, or metadata thereof, have been checked in the found content.
  • the ranking result may not be adjusted according to the emotional word(s) (or phrase(s)) after the content relevant to the non-emotional word(s) (or phrase(s)) is found.
  • the result may be adjusted according to the relationship with the content and the non-emotional word (or phrase) after the content relevant to the emotional word(s)(or phrase) is found.
  • indexes of the emotional words (or phrase) and non-emotional ones (or phrase) may be built in a matrix structure for use.
  • intensity of a feeling of an emotional search query may influence a ranking result.
  • the search result is sorted simply in descending order of scores of the "gloomy” item.
  • content having the "gloomy" score corresponding to "little” may be ranked more highly.
  • the above idea may be implemented as follows. On condition that there is an adverb to express intensity of a feeling in a search query, a score of the adverb is set.
  • the adverbs "very,” “fairly,” “somewhat,” “rarely,” “scarcely” and “never” are respectively set to 10, 7, 5, 3, 1 and 0.
  • pieces of content that have impression scores (approximately) corresponding to the score of the intensity of the feeling are ranked more highly. For example, suppose that the scores of the "gloomy" items in the impression score tables of web pages A and B are respectively 8 and 10; when a user enters a search query including the emotional words "fairly gloomy,” the adverb "fairly” is set to 7 according to the above instance. Because the score of A is more approximate to the score 7 than that of B, A is ranked higher than B.
  • the above-stated idea may be considered an analog search method.
  • the server makes the user's terminal display the adjusted search result (obtained through the step S 107).
  • the displayed result may have a variety of representation.
  • scores of the impression items about content are clearly visualized to a user. More specifically, a score of each impression item is represented in the form of a bar graph. Additionally, a trend of an impression score about content may be clearly displayed. More specifically, a change of an impression score about content can be displayed in a line chart.
  • content and the impression score tables, or metadata thereof may be well structured so that they are easily accessed, read and browsed. More specifically, the data can be structured in the form of directories or a matrix so that it is displayed in a user's terminal.
  • step S 104 is omitted. Then, any piece of content whose an impression item corresponding to the query is checked may get a high rank.
  • the system includes web servers 901 ; a main server for the system 910; a user's terminal 930; a database for an impression classification table 903 and a search database 904.
  • the main server 910 gets web documents with comments through the telecommunications network 902 from the web servers 901.
  • the device 930 is used to enter a search query including emotional word(s). It is a terminal of a PC, a mobile phone, a PDA (Personal digital assistant) or any other device. It is linked to the main server 910 across the telecommunications network 902. A user gets a search result in response to a query including an emotional word using the terminal 930.
  • the main server 910 is managed by a search provider.
  • the server stores the database for the impression classification table 903 and the search database 904; it controls and manages steps for searching information of collective information based on comments about content.
  • the search provider sends a search result, which is sorted by a proper ranking, back to the terminal of the user who entered a query including an emotional word, as well as managing the main server 910.
  • the main server 910 includes the following modules: a document collecting module 911, a comment extracting module 912, an emotional-word finding module 913, an impression-class looking-up module 914, an impression-item checking module 915, a database storing module 916, a data transferring module 917, a content finding module 918, a rank adjusting module 919 and a result handling module 920.
  • the module 911 collects documents to construct the search database from the web servers 901 by using a web robot or any other method.
  • the module 912 separates and extracts comments from the documents collected by the module 911.
  • the module 913 finds, separates and extracts emotional word(s) in comments on content or in a search query including emotional word(s).
  • the module 914 looks up an impression class to which the extracted emotional word(s) belong in the database for the impression classification table.
  • the module 915 checks a matched item, in the impression score table, set by the module 914 or assigns a score to the item.
  • the module finds out whether an impression item, which is corresponding to the impression class of the search query, is checked or a score is assigned to the item.
  • the module 916 stores information about the content and the impression score table, or metadata thereof into the search database.
  • the module 917 receives a search query from the user's terminal 930.
  • the module 914 looks up an impression class to which the extracted emotional word(s) belong in the database for the impression classification table.
  • the module 915 checks a matched item, in the impression score table, set by the module 914 or assigns a score to the item.
  • the module finds out whether an impression item
  • the module 918 finds content relevant to the non-emotional word(s) in the search query, in the search database. In the content found by the module 918, if one or more of their impression items corresponding to an impression class set by the module 915 are checked, the checked pieces of content are considered highly relevant to the query. Thus the module 919 adjusts the
  • the module 920 makes the user's terminal 930 display the
  • the main server 910 stores the database 903 (see FIG. 9). An impression
  • the main server 910 has the database 904 (see
  • FIG. 9 that stores information about content and the impression score tables, or metadata
  • FIG. 1 is a flow chart illustrating process for finding information in response to a
  • search query including an emotional word
  • FIG. 2 is a flow chart illustrating process for constructing the search database
  • FIG. 3 presents an impression classification table stored in a database
  • FIG. 4 is a view illustrating exemplary HTML files to link dance music content
  • FIG. 5 is a view illustrating impressions and reviews, in comment sections, posted
  • FIG. 6 is a view illustrating an inverted index created by indexing the documents
  • FIG. 7 is an exemplary view illustrating records comprising the URLs of content
  • FIG. 8 is an exemplary view illustrating relationship between the records
  • FIG. 9 is a general view illustrating the system for searching information of
  • search query including an emotional word is retrieved, as shown in a flow chart of FIG. 1.
  • a search database is constructed, as
  • the search database should be constructed in advance (SlOl in FIG. 1), which will be explained in detail in the second embodiment later.
  • the main server When a user enters the search query "fun dance music” (S 102), the main server receives the query and then separates/extracts the emotional word “fun” and the non-emotional phrase "dance music” (S 103).
  • the server finds a list of information about documents/content relevant to the index term "dance music" in the search database (S 104).
  • a list of information about documents/content relevant to the index term "dance music" in the search database S 104.
  • an item to indicate webpage A and that to indicate B are stored in the order as shown in FIG. 6.
  • each record in 811, in the search database includes an impression score table and content's URL (which is a key field).
  • the server finds such a record (in 801) whose the content's URL field matches the content's URL field of the record related to webpage A/B (see 801 and 802 in FIG. 8).
  • the server finds out which of impression classes the emotional word "fun” belongs to in the impression classification table (see FIG. 3) (S 105). As shown in the table, “fun” belongs to the item “merry/gloomy” and it has a positive score.
  • the server examines whether a score is assigned to the "merry/gloomy" item of each impression score table of dance music A and B in the search database (S 106).
  • the server finds out the numbers 0 and +3 are assigned to "merry/gloomy" item of A and B respectively in FIG. 7.
  • the server adjusts a ranking number given to each web page on the basis of the scores taken as above (S 107).
  • webpage B is ranked higher than A.
  • the server finds out the numbers -2 and 0 are assigned to the "interesting/boring" item of dance music A and B, respectively.
  • the server reverses the sign of an impression score of the emotional word before subtracting each impression score from a ranking number.
  • A is ranked higher than B; the server makes the user's terminal display the search result (S 108).
  • Second embodiment is as follows.
  • a search provider previously creates a database for an impression classification table in which a variety of emotional words are classified as shown in FIG. 3. For example, "interesting” and “boring” are opposite to each other but distinguished from other feelings. Thus, one impression class item called "interesting/boring” is set.
  • the words “tedious” and “boring” are classified into the impression class "interesting/boring.” Accordingly, when the word “tedious” is included in the comment, a negative score is assigned to an item of the impression class "interesting/boring.” In addition, the words “fun,” “merry” and “cheerful” belong to the impression class "merry/gloomy” and a positive score is assigned to an item of the class.
  • An administrator of a website or a normal user uploads two HTML files to link dance music content on the website as shown in FIG. 4.
  • Content 401 and 402 are the sources of the two.
  • the anchor text in content 401 contains "dance music A” and 402 "dance music B" to describe the content.
  • Each of the two links refers to related content.
  • the two are displayed, like web pages 403 and 404, to human users. Users visit the web site and appreciate the dance music linked to the web pages.
  • the server collects the web pages with the comments as shown in FIG.
  • the collected documents may be indexed and ranked.
  • the server stores the index term "dance music" with the URLs of the web pages related thereto, the URLs of relevant content, etc into the search database 904. Additionally, ranking numbers are stored along with them.
  • the rankings numbers may be computed according to the present invention, or may be done by other algorithms irrespective of the invention. In the embodiment, regardless of an emotional word, webpage A got the ranking number 1 and B the number 2 (the lower the ranking number is, the higher the rank is).
  • webpage A is ranked higher than B in its response.
  • the server analyzes impressions in the comments about dance music A and B; it classifies them.
  • the result is stored in the search database.
  • Each of the stored records includes content's URL field and items of impression scores of the content (701 in FIG. 7), where the content's URL field is the key of the record.
  • a unique identification number is used as a document/content identifier.
  • content's URL is used as the identifier.
  • the server extracts comments from the collected documents
  • the server extracts two emotional words “boring” and “tedious” from the comments about dance music A. As shown in the impression classification table of FIG. 3, the two words belong to the impression item "interesting/boring.” The server assigns a negative score, which the two words get, to the item. Thus, the score -2 is assigned to the item "interesting/boring" of the corresponding record (702 in FIG. 7).
  • the server extracts the three emotional words “fun,” “merry” and “cheerful” from the comments about dance music B. As shown in the impression classification table, the three words belong to the impression item "merry/gloomy.” The server assigns a positive score, which the words get, to the item. Thus, the score +3 is assigned to the item "merry/gloomy" of the corresponding record (703 in FIG. 7). As described above, the server constructs the search database which stores the information about the content and the impression score tables thereof (S206).

Abstract

The present invention relates to information search method and system aggressively using comments written by users who have appreciated content. An object of the invention is to display a search result, which is sorted by a proper ranking, in response to a query including an emotional word. For that purpose, while a search database is constructed, firstly emotional words are extracted from the comments and categorized. Next, impressions, or metadata about content are organized from them. Finally the metadata and information about the content are stored. Afterwards, when a user enters a search query including an emotional word, firstly emotional words and non-emotional ones are extracted from the query. Next, content relevant to the non-emotional word is found. Lastly, a ranking result is adjusted according to 'checked/unchecked' values (or scores) of an impression item, which matches the emotional word, of the found content.

Description

[DESCRIPTION]
[Invention Title]
METHOD AND SYSTEM FOR SEARCHING INFORMATION OF COLLECTIVE EMOTION BASED ON COMMENTS ABOUT CONTENTS ON INTERNET
[Technical Field]
The present invention relates to information search method and system using a
computer or telecommunications networks. More particularly, it relates to information
search method and system that provide a list of content, which is sorted by a proper ranking,
corresponding to a search query including an emotional word.
[Background Art]
When a user enters a search query including an emotional word (e.g., "beautiful
sea photo") into an Internet search engine, current search engines have not been able to
provide a high-quality search result, which is sorted by a proper ranking, corresponding to
the query. For that reason, the search service administrators have decided the ranking result
subjectively, or the search engine has merely provided content with text information (e.g.,
an image file name) that matches the emotional word included in the query. The above
conventional method has been very inefficient in that a few service administrators should manually edit search results from a great amount of content that is increasing quickly on the Internet. In addition, there has been a problem that such search results have low objectivity and reliability because they are decided subjectively by the few service administrators.
In the conventional art, when a search query including an emotional word is issued, the ranking result has been determined usually on the basis of information about documents containing images. In other words, the image file name, an anchor text to link the image file or information/title/text of a web site where the image is stored has been used. However, there has been a problem that the image file name, etc. do not frequently describe the image properly. On the other hand, there has been a trial that emotional information is extracted from bits to form the content of an image or video itself; a database of the extracted information is used for a search engine. However, this method has been doubtful to search for content relevant to a query representing complicated, delicate and esoteric emotion of human. Besides, the method has been too expensive to be practically used. Additionally, in the registered Korean Patent Publication No. 10-0462542 titled by
"Contents search system for providing confidential contents through network and method thereof," the number of comments posted by users about content is introduced as a factor to evaluate its reliability. However, in the method, the reliability is evaluated by using only the number of comments regardless of the content of the comments. Accordingly, the method
does not refer to the content of the comments and thus has not been able to provide a
high-quality result when a user enters a query including an emotional word.
[Disclosure]
[Technical Problem]
An object of the present invention is to introduce information search method and
system that can provide a result using objective and reliable ranking method in response to
a query including an emotional word. Moreover, the invention must be applicable in
industry. For that purpose, the system collects comments about content on the Internet,
constructs a search database using them and utilized it.
[Technical Solution]
To solve the technical problem, the invention produces information search method
and system based on the following two assumptions.
First, people have similar feelings about the same content. In other words, one's
feeling about content is similar to another's feeling about it. For example, when one feels a
certain photo is beautiful, another will also feel so. Second, the more users post their
comments (annotation, remark, user feedback, reply, review or suchlike) that describe their feelings about content, the more the sum or average of impressions in the comments is approximate to the intensity and the kind of a normal man's feeling about the content. In the present invention, the above assumptions are defined as collective emotion analogous to the concept collective intelligence that states that participation and collaboration of many individuals produce better intellectual results.
On the basis of the above assumptions, the invention aggressively makes use of comments posted by users that appreciated content. The invention constructs a database from the information. Using the database, it provides a method and system for retrieving information matching a query including an emotional word in order to solve the problem. The present invention provides a method for searching information of collective emotion, which comprises the following processes.
First, a main server in the system constructs a search database in which impression score tables are stored (SlOl), where each row of an impression score table consists of two fields: one is the name of an item in which emotional words are categorized, and the other is its value (See FIG. 7).
Then, the server receives a search query from a user (S 102). The server separates and extracts non-emotional word(s) and emotional ones(s) from the transferred query (S 103). Next, the server finds content relevant to the non-emotional words in the search database (S 104). In this step, if there is no non-emotional word in the query, step S 104 may be omitted.
Next, the server finds out which of impression classes in an impression classification table the extracted emotional word(s) belongs to (S 105).
Next, the server determines whether an item, which matches the found impression class in S 105, in each impression score table of the content, which has been found in step S 104, is checked or a score is assigned in the item (S 106).
Next, the server adjusts the ranks of the found content according to a predetermined method dependent on the "checked" values or the scores (S 107). The method of adjusting the rank will be explained later in exemplary embodiments.
Finally, the server makes the user's terminal display the adjusted search result (S108).
As described above, information of collective emotion based on comments about content is retrieved.
Hereinafter, each step of the search method will be explained in detail.
Step SlOl comprises the following sub-steps. The server collects documents with comments on the Internet (S201); the server extracts comments from the collected documents (S202). More particularly, the server collects documents with comments by using a web robot that automatically selects and collects fit information from web documents on the Internet, and extracts the comments from the collected documents.
Then, the server searches extracted comments for emotional words (S203). More particularly, the server separates and extracts emotional words (or phrases) from the extracted comments by using processing such as morphological analysis and word stemming. After that, the server finds out which of impression classes in an impression classification table each of the found emotional words of the content belongs to (S204). Then, the server checks corresponding items in the impression score tables of the content or assigns scores to them (S205).
The impression classification table (See FIG. 3) means a table in which emotional words are classified and itemized. For example, the impression classification table in FIG. 3 shows that the emotional word "angry" belongs to the impression item "pleasant/angry."
The names of items in the table may be set to a diversity of adjectives (or adverbs). or instance, the names are set into "glad, angry, sorrowful, pleasant, lovely, hateful, desirable, beautiful, ugly, good and nicely." The classification method is not fixed. On the contrary, it may be changed. Moreover items in the table can be classified either briefly or in detail. For example, "lovely" and "cute" are put into the same category.
As shown before, in step S205, a score may be assigned to an item in the table as well as the item can be checked.
In step S205, scores in the items in the table may be assigned according to the number of emotional comments (or the number of users that posted the comments) and the intensity of feelings. Methods of assigning the scores can be as follows.
First, suppose that there are news content A and B on a web site; users posted 10 comments in which the "good news" responses were written about A, and 3 comments where the "good news" responses were written were posted about B. In this case, a score of the "good" item in the impression score table of A is higher than that of B.
Second, a score of the word "delightful" is higher than that of "glad" because "delightful" means "very glad." In other words, the more intense an emotional word is, the higher score it gains.
Third, the score may be adjusted by users' recommending (or assenting to) or dissenting from comments. Or, it may be adjusted by intensity of a feeling that is computed according to users' rating content, not text-based comment. Fourth, when the emotional words are classified, feelings of a kind and the opposite feelings may be categorized into the same item. And then words related to the opposite feelings decrease the score field of the item. For example, "joyful" and "sorrowful" are opposite to each other but distinguished from other feelings. Thus, they can be categorized into an item; emotional words related to "joyful" increase the score of the item. And emotional words related to "sorrowful" decrease the score.
Fifth, when an emotional word represents complex feelings, a score according to the word may be assigned to plural items in an impression score table. As one example, the emotional word "magnificent" means both "grand" and "gorgeous." Therefore a score of "magnificent" is assigned to two items to which "grand" and "gorgeous" belong.
Sixth, authority, reputation or reliability of a user may influence an impression score of comment that the user posted.
As the next step of S205, the server stores information about the content and the impression score tables, or metadata thereof (see FIG. 7) into the search database (S206). Thereby constructing the database (SlOl) has been finished.
The above information about content includes index terms, the URL of a webpage containing the content, the URL of the content, ranking number(s) related to the content and so on, as shown in FIG. 6. The following is illustrating constructing the above-stated database (SlOl). Suppose that users appreciate a photo titled by "baby photo" in which babies are playing house in a web document; the users post two comments in which "pretty" and "cute" are written. Then the server collects the document and constructs the search database, the server separates and extracts emotional words (or phrases) from the comments about the photograph. The server stores the impression score table of the content, information about it (e.g., URI, URL, condensed information or content itself) and information about documents related to it (e.g., text in the webpage) into the database, where an item named "pretty" in the impression score table is checked. Additionally, content and words (or phrases) in documents related to them (e.g., web pages) may be indexed or ranked before/after constructing the database. Furthermore, expected phrases to combine emotional words (or phrases) and non-emotional words (or phrases) may be indexed or ranked in advance. Note that comments about content may be considered as a part of the document. The indexing (strictly speaking, inverted indexing) and ranking for the search engine may be processed according to the present invention or other search methods. In addition, objects to be indexed include words (, word groups or phrases) in content or documents, but not limited thereto. Thus, comments (including emotional words and non-emotional ones) attached to content or documents may be indexed.
In step S 102, the server receives a search query from a user. More particularly, a user sends a search query including an emotional word to the server using the user's terminal.
In step S 103, the server separates and extracts emotional word(s) and non-emotional one(s) from the transmitted query. More particularly, the server separates and extracts emotional word(s) and non-emotional one(s) by using processing such as morphological analysis and word stemming. If only an emotional word is in the query, it is self-evident that only the emotional word will be separated and extracted.
In step S 104, the server finds content relevant to the extracted non-emotional word(s) in the search database. More particularly, the server finds an index term that matches the non-emotional word(s) in the database and then a list of content to which the index term points is found in the database. FIG. 6 shows that if a separated non-emotional word (or phrase) is "dance music," web pages A and B where the phrase occurs are found.
In step S 105, the server finds out which of impression classes in an impression classification table the emotional word(s) belongs to, where the emotional word has been separated from the search query in step S 103. FIG. 3 shows that if the separated emotional word is "boring," it belongs to the item "interesting/boring" in the table.
In step S 106, the server determines whether an item, which matches the found impression class, in each impression score table of the content found in step S 104 has been checked. To put it in another way, it looks up an item, in the impression score table, corresponding to the item in the impression classification table, which has been set according to the emotional word(s); it examines the value of the very item of each impression score table of the content, which has been found according to the non-emotional words. Also in the case where a score is assigned to the item, the process is the same as the above-stated that. However, if there is no non-emotional word in the search query, step S 104 will be omitted and the server finds all content whose the corresponding items are checked or have scores.
In step S 107, the server adjusts the ranks of the found content according to the "checked/unchecked" values of the matched items. In other words, pieces of content that have "checked" values of the matched items in the found content (which is relevant to the non-emotional words) are considered highly relevant to the query, the ranks of them are thus adjusted. Of course, in the case where a score is assigned to the item, the ranks of the above-stated process are adjusted according to the score. The following are illustrating the rank adjusting methods.
When a user entered the search query "cute baby photo," firstly the server finds content relevant to the non-emotional word (or phrase) "baby photo." Then, the server raises the ranks of content whose the "cute" items, in the impression score tables, or metadata thereof, have been checked in the found content.
However, the ranking result may not be adjusted according to the emotional word(s) (or phrase(s)) after the content relevant to the non-emotional word(s) (or phrase(s)) is found. In other words, the result may be adjusted according to the relationship with the content and the non-emotional word (or phrase) after the content relevant to the emotional word(s)(or phrase) is found. Besides, indexes of the emotional words (or phrase) and non-emotional ones (or phrase) may be built in a matrix structure for use.
In addition, intensity of a feeling of an emotional search query may influence a ranking result. For example, when a user enters the search query "gloomy photo," the search result is sorted simply in descending order of scores of the "gloomy" item. However, provided that a user enters "little gloomy photo" including an adverb which represents intensity of a feeling, content having the "gloomy" score corresponding to "little" may be ranked more highly. The above idea may be implemented as follows. On condition that there is an adverb to express intensity of a feeling in a search query, a score of the adverb is set. For instance, the adverbs "very," "fairly," "somewhat," "rarely," "scarcely" and "never" are respectively set to 10, 7, 5, 3, 1 and 0. When a result of such a search is generated, pieces of content that have impression scores (approximately) corresponding to the score of the intensity of the feeling are ranked more highly. For example, suppose that the scores of the "gloomy" items in the impression score tables of web pages A and B are respectively 8 and 10; when a user enters a search query including the emotional words "fairly gloomy," the adverb "fairly" is set to 7 according to the above instance. Because the score of A is more approximate to the score 7 than that of B, A is ranked higher than B. The above-stated idea may be considered an analog search method.
In addition, when users have different feelings about the same content (particularly, opposite impressions are mixed in comments), such a condition may influence a ranking result. For example, suppose that users posted the ten "interesting" comments about video content A; the ten "interesting" comments and the three "boring" comments about video content B. When the query "interesting videos" is entered, 10 is added to the ranking score ofA and 7(= 10 - 3) that ofB.
Finally, in step S 108, the server makes the user's terminal display the adjusted search result (obtained through the step S 107). The displayed result may have a variety of representation. As one example, scores of the impression items about content are clearly visualized to a user. More specifically, a score of each impression item is represented in the form of a bar graph. Additionally, a trend of an impression score about content may be clearly displayed. More specifically, a change of an impression score about content can be displayed in a line chart. In addition, content and the impression score tables, or metadata thereof may be well structured so that they are easily accessed, read and browsed. More specifically, the data can be structured in the form of directories or a matrix so that it is displayed in a user's terminal.
In the case where there are only emotional words in a search query (e.g., "beauty" and "benignity"), step S 104 is omitted. Then, any piece of content whose an impression item corresponding to the query is checked may get a high rank.
Besides, apart from the above-stated method that uses database which is previously crawled, indexed and ranked for a search engine, it is possible that content, the related documents and the attached comments are serially scanned on demand and the result sorted by the ranking method is immediately generated.
Hereinafter, a system (See FIG. 9) which searches information of collective emotion based on comments about content will be explained. The system includes web servers 901 ; a main server for the system 910; a user's terminal 930; a database for an impression classification table 903 and a search database 904.
To be more particular, the main server 910 gets web documents with comments through the telecommunications network 902 from the web servers 901. The device 930 is used to enter a search query including emotional word(s). It is a terminal of a PC, a mobile phone, a PDA (Personal digital assistant) or any other device. It is linked to the main server 910 across the telecommunications network 902. A user gets a search result in response to a query including an emotional word using the terminal 930.
The main server 910 is managed by a search provider. In the present invention, the server stores the database for the impression classification table 903 and the search database 904; it controls and manages steps for searching information of collective information based on comments about content. The search provider sends a search result, which is sorted by a proper ranking, back to the terminal of the user who entered a query including an emotional word, as well as managing the main server 910. The main server 910 includes the following modules: a document collecting module 911, a comment extracting module 912, an emotional-word finding module 913, an impression-class looking-up module 914, an impression-item checking module 915, a database storing module 916, a data transferring module 917, a content finding module 918, a rank adjusting module 919 and a result handling module 920.
The module 911 collects documents to construct the search database from the web servers 901 by using a web robot or any other method. The module 912 separates and extracts comments from the documents collected by the module 911.
The module 913 finds, separates and extracts emotional word(s) in comments on content or in a search query including emotional word(s). The module 914 looks up an impression class to which the extracted emotional word(s) belong in the database for the impression classification table. The module 915 checks a matched item, in the impression score table, set by the module 914 or assigns a score to the item. The module finds out whether an impression item, which is corresponding to the impression class of the search query, is checked or a score is assigned to the item. The module 916 stores information about the content and the impression score table, or metadata thereof into the search database. The module 917 receives a search query from the user's terminal 930. The module
918 finds content relevant to the non-emotional word(s) in the search query, in the search database. In the content found by the module 918, if one or more of their impression items corresponding to an impression class set by the module 915 are checked, the checked pieces of content are considered highly relevant to the query. Thus the module 919 adjusts the
ranks of the found content. The module 920 makes the user's terminal 930 display the
search result adjusted by the module 919.
The main server 910 stores the database 903 (see FIG. 9). An impression
classification table (see FIG. 3), where the search provider has classified emotional words
and itemized them, is in the database 903. The main server 910 has the database 904 (see
FIG. 9) that stores information about content and the impression score tables, or metadata
thereof.
[Advantageous Effects ]
As described above, the information search method and system according to the
present invention produce the following effects.
First, a systematic and reliable search result can be obtained because the result is
provided through the objective and formulated ranking method based on collective emotion.
It is different from the conventional method in which the result is obtained by impressions
of a few administrators. Thus, a variety of costs (e.g., labor cost for the administrators) are
reduced. Second, content irrelevant to such a search query is not or scarcely displayed in top
rank because collective emotion is reflected, differently from the conventional method in
which only relies on text information related to content.
[Brief description of Drawings]
FIG. 1 is a flow chart illustrating process for finding information in response to a
search query including an emotional word;
FIG. 2 is a flow chart illustrating process for constructing the search database;
FIG. 3 presents an impression classification table stored in a database;
FIG. 4 is a view illustrating exemplary HTML files to link dance music content
uploaded into a web site by a web site manager or a normal user;
FIG. 5 is a view illustrating impressions and reviews, in comment sections, posted
by users who have visited the web site and appreciated content linked thereto;
FIG. 6 is a view illustrating an inverted index created by indexing the documents
that have been collected;
FIG. 7 is an exemplary view illustrating records comprising the URLs of content
and impressions about that which is stored in the search database; FIG. 8 is an exemplary view illustrating relationship between the records
comprising the URLs of content and impressions about that which is stored in the search
database AND items in the relevant list of information about documents/content in the
inverted index; and
FIG. 9 is a general view illustrating the system for searching information of
collective emotion based on comments about content.
[Best Mode]
Hereinafter, embodiments of the present invention will be described in detail with
reference to the accompanying drawings. In the entire description of the present invention,
the same drawing reference numerals are used for the same elements across various figures.
According to a first embodiment of the present invention, information responsive to a
search query including an emotional word is retrieved, as shown in a flow chart of FIG. 1.
According to a second embodiment of the invention, a search database is constructed, as
shown in a flow chart of FIG. 2.
First embodiment is as follows.
The embodiment for retrieving information responsive to a search query including
an emotional word will be explained in detail with reference to FIG. 1. The search database should be constructed in advance (SlOl in FIG. 1), which will be explained in detail in the second embodiment later.
When a user enters the search query "fun dance music" (S 102), the main server receives the query and then separates/extracts the emotional word "fun" and the non-emotional phrase "dance music" (S 103).
Next, the server finds a list of information about documents/content relevant to the index term "dance music" in the search database (S 104). In the list, an item to indicate webpage A and that to indicate B are stored in the order as shown in FIG. 6.
As shown in records 811 in FIG. 8, each record in 811, in the search database, includes an impression score table and content's URL (which is a key field). The server finds such a record (in 801) whose the content's URL field matches the content's URL field of the record related to webpage A/B (see 801 and 802 in FIG. 8).
And the server finds out which of impression classes the emotional word "fun" belongs to in the impression classification table (see FIG. 3) (S 105). As shown in the table, "fun" belongs to the item "merry/gloomy" and it has a positive score.
Next, the server examines whether a score is assigned to the "merry/gloomy" item of each impression score table of dance music A and B in the search database (S 106). The server finds out the numbers 0 and +3 are assigned to "merry/gloomy" item of A and B respectively in FIG. 7.
The server adjusts a ranking number given to each web page on the basis of the scores taken as above (S 107). In this case, the end ranking number of webpage A is 1 because 1 - 0 = 1 and that of B is -1 because 2 - (+3) = -1. As a result, differently from the query "dance music," the response to the query "fun dance music" reflects collective emotion. Thus, webpage B is ranked higher than A.
When a searcher enters the query "boring dance music," in the same way, the server finds out the numbers -2 and 0 are assigned to the "interesting/boring" item of dance music A and B, respectively. On condition that the extracted emotional word(s) is negative, the server reverses the sign of an impression score of the emotional word before subtracting each impression score from a ranking number. In this case, the end ranking number of webpage A is -1 because 1 - (-(-2)) = -1 and that of B is 2 because 2 - 0 = 2. Thus, webpage
A is ranked higher than B; the server makes the user's terminal display the search result (S 108).
Second embodiment is as follows.
The embodiment for constructing the search database will be explained in detail with reference to FIG. 2. A search provider previously creates a database for an impression classification table in which a variety of emotional words are classified as shown in FIG. 3. For example, "interesting" and "boring" are opposite to each other but distinguished from other feelings. Thus, one impression class item called "interesting/boring" is set. In addition, the words "tedious" and "boring" are classified into the impression class "interesting/boring." Accordingly, when the word "tedious" is included in the comment, a negative score is assigned to an item of the impression class "interesting/boring." In addition, the words "fun," "merry" and "cheerful" belong to the impression class "merry/gloomy" and a positive score is assigned to an item of the class. An administrator of a website or a normal user uploads two HTML files to link dance music content on the website as shown in FIG. 4. Content 401 and 402 are the sources of the two. The anchor text in content 401 contains "dance music A" and 402 "dance music B" to describe the content. Each of the two links refers to related content. The two are displayed, like web pages 403 and 404, to human users. Users visit the web site and appreciate the dance music linked to the web pages.
Then, the users post their impressions and opinions into the comment sections (FIG. 5). The impressions for dance music A were not good and two users wrote negative comments (501). On the contrary, the impressions for dance music B were good and three users wrote positive comments (502).
At this time, the server collects the web pages with the comments as shown in FIG.
5 (S201 in FIG. 2). The collected documents may be indexed and ranked. When constructing the inverted index 601 in FIG. 6, the server stores the index term "dance music" with the URLs of the web pages related thereto, the URLs of relevant content, etc into the search database 904. Additionally, ranking numbers are stored along with them. The rankings numbers may be computed according to the present invention, or may be done by other algorithms irrespective of the invention. In the embodiment, regardless of an emotional word, webpage A got the ranking number 1 and B the number 2 (the lower the ranking number is, the higher the rank is).
Merely if the search query "dance music" is issued, webpage A is ranked higher than B in its response.
The server analyzes impressions in the comments about dance music A and B; it classifies them. The result is stored in the search database. Each of the stored records includes content's URL field and items of impression scores of the content (701 in FIG. 7), where the content's URL field is the key of the record. Usually, a unique identification number is used as a document/content identifier. However, in the embodiment, content's URL is used as the identifier.
For that purpose, the server extracts comments from the collected documents
(S202) and finds emotional words in the extracted comments through the word stemming, etc (S203). Then, the server finds out which of the items in the impression classification table each of the emotional words belongs to (S204). Then, the server assigns scores to items, in the impression score tables, corresponding to the emotional words (S205).
More particularly, the server extracts two emotional words "boring" and "tedious" from the comments about dance music A. As shown in the impression classification table of FIG. 3, the two words belong to the impression item "interesting/boring." The server assigns a negative score, which the two words get, to the item. Thus, the score -2 is assigned to the item "interesting/boring" of the corresponding record (702 in FIG. 7).
On the other hand, the server extracts the three emotional words "fun," "merry" and "cheerful" from the comments about dance music B. As shown in the impression classification table, the three words belong to the impression item "merry/gloomy." The server assigns a positive score, which the words get, to the item. Thus, the score +3 is assigned to the item "merry/gloomy" of the corresponding record (703 in FIG. 7). As described above, the server constructs the search database which stores the information about the content and the impression score tables thereof (S206).
It should be understood by those of ordinary skill in the art that various replacements, modifications and changes in the form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims. Therefore, it is to be appreciated that the above described embodiments are for purposes of illustration only and are not to be construed as limitations of the invention.

Claims

[CLAIMS] [Claim 1 ]
A method for searching information of collective emotion based on comments
about content, comprising:
constructing a search database in which impression score tables are stored (SlOl);
receiving a search query from a user (S 102);
separating and extracting emotional word(s) and non-emotional one(s) from the
transferred query (S 103);
finding out which of impression classes in an impression classification table the
extracted emotional word(s) belongs to (S 105);
determining whether an item, which matches the class, in each impression score
table of (the) content is checked or a score is assigned to the item (S 106);
adjusting a ranking result according to a predetermined method dependent on the
"checked" values or the scores of the items (S 107); and
making the user's terminal display the adjusted search result (S 108).
[Claim 2]
The method of claim 1, the step S 103 comprises: finding content relevant to the extracted non-emotional word(s) in the search
database after the step S 103 (S 104).
[Claim 3]
The method of claim 1, wherein the step SlOl comprising:
collecting documents to which comments are attached on the Internet (S201);
extracting comments from the collected documents (S202);
searching the extracted comments for emotional words (S203);
finding out which of impression classes in the impression classification table each
of the found emotional words of the content belongs to (S204);
checking corresponding items in the impression score tables or assigning scores to
them (S205); and
storing information about the content and the corresponding impression score
tables into the search database (S206).
[Claim 4] A system for searching information of collective emotion based on comments
about content, comprising a main server,
where the main server comprises: a database, for an impression classification table, to store an impression classification table in which names of impression classes and emotional words corresponding thereto have been classified; a search database to store information about content and the impression score tables thereof; a document collecting module to collect documents, which are necessary to construct the search database, from web servers that are linked to the main server across telecommunications networks; a comment extracting module to separate and extract comments from the collected documents; an emotional-word finding module to separate and extract emotional word(s) from comments about content or a search query including emotional word(s); an impression-class looking-up module to find out which of impression classes in an impression classification table the extracted emotional word(s) belongs to; an impression-item checking module to check an item, which matches the emotional word(s), in each impression score table of content or assign a score to the item , and to examine whether an item, which matches the class, in each impression score table of content is checked or a score is assigned to the item ; and a database storing module to store information about content and the impression
score tables thereof into the search database.
[Claim 5]
The system of claim 4, wherein the main server comprises:
a data transferring module to receive a search query from a user that entered it into
a terminal of a client across telecommunications networks;
a content finding module to find content relevant to non-emotional word(s)
included in the search query in the search database;
a rank adjusting module to adjust a ranking result according to the
"checked/unchecked" value or the score of the matched item in each impression score table
of the found content; and
a result handling module to make the user's terminal display the adjusted search
result.
PCT/KR2008/007228 2007-12-24 2008-12-05 Method and system for searching information of collective emotion based on comments about contents on internet WO2009082100A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/679,011 US20100262597A1 (en) 2007-12-24 2008-12-05 Method and system for searching information of collective emotion based on comments about contents on internet

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020070136565A KR100917784B1 (en) 2007-12-24 2007-12-24 Method and system for retrieving information of collective emotion based on comments about content
KR10-2007-0136565 2007-12-24

Publications (4)

Publication Number Publication Date
WO2009082100A2 WO2009082100A2 (en) 2009-07-02
WO2009082100A3 WO2009082100A3 (en) 2009-08-13
WO2009082100A4 WO2009082100A4 (en) 2009-10-29
WO2009082100A9 true WO2009082100A9 (en) 2010-03-25

Family

ID=40801655

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2008/007228 WO2009082100A2 (en) 2007-12-24 2008-12-05 Method and system for searching information of collective emotion based on comments about contents on internet

Country Status (3)

Country Link
US (1) US20100262597A1 (en)
KR (1) KR100917784B1 (en)
WO (1) WO2009082100A2 (en)

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9690790B2 (en) 2007-03-05 2017-06-27 Dell Software Inc. Method and apparatus for efficiently merging, storing and retrieving incremental data
US20100107075A1 (en) * 2008-10-17 2010-04-29 Louis Hawthorne System and method for content customization based on emotional state of the user
US20100100827A1 (en) * 2008-10-17 2010-04-22 Louis Hawthorne System and method for managing wisdom solicited from user community
US20100100826A1 (en) * 2008-10-17 2010-04-22 Louis Hawthorne System and method for content customization based on user profile
US20110113041A1 (en) * 2008-10-17 2011-05-12 Louis Hawthorne System and method for content identification and customization based on weighted recommendation scores
US20110016102A1 (en) * 2009-07-20 2011-01-20 Louis Hawthorne System and method for identifying and providing user-specific psychoactive content
KR101181073B1 (en) 2009-07-28 2012-09-07 현대자동차주식회사 Clutch Actuating Apparatus for Double Clutch Transmission
US11113299B2 (en) 2009-12-01 2021-09-07 Apple Inc. System and method for metadata transfer among search entities
US11036810B2 (en) * 2009-12-01 2021-06-15 Apple Inc. System and method for determining quality of cited objects in search results based on the influence of citing subjects
US20110154197A1 (en) * 2009-12-18 2011-06-23 Louis Hawthorne System and method for algorithmic movie generation based on audio/video synchronization
US8888497B2 (en) * 2010-03-12 2014-11-18 Yahoo! Inc. Emotional web
US9613139B2 (en) 2010-03-24 2017-04-04 Taykey Ltd. System and methods thereof for real-time monitoring of a sentiment trend with respect of a desired phrase
US9946775B2 (en) 2010-03-24 2018-04-17 Taykey Ltd. System and methods thereof for detection of user demographic information
US8782046B2 (en) 2010-03-24 2014-07-15 Taykey Ltd. System and methods for predicting future trends of term taxonomies usage
WO2011119410A2 (en) * 2010-03-24 2011-09-29 Taykey, Ltd. A system and methods thereof for mining web based user generated content for creation of term taxonomies
US8965835B2 (en) 2010-03-24 2015-02-24 Taykey Ltd. Method for analyzing sentiment trends based on term taxonomies of user generated content
US9183292B2 (en) 2010-03-24 2015-11-10 Taykey Ltd. System and methods thereof for real-time detection of an hidden connection between phrases
US10600073B2 (en) 2010-03-24 2020-03-24 Innovid Inc. System and method for tracking the performance of advertisements and predicting future behavior of the advertisement
US9323844B2 (en) 2010-06-11 2016-04-26 Doat Media Ltd. System and methods thereof for enhancing a user's search experience
US9069443B2 (en) 2010-06-11 2015-06-30 Doat Media Ltd. Method for dynamically displaying a personalized home screen on a user device
US10713312B2 (en) 2010-06-11 2020-07-14 Doat Media Ltd. System and method for context-launching of applications
US9338215B2 (en) 2011-03-14 2016-05-10 Slangwho, Inc. Search engine
US9858342B2 (en) 2011-03-28 2018-01-02 Doat Media Ltd. Method and system for searching for applications respective of a connectivity mode of a user device
US9152697B2 (en) 2011-07-13 2015-10-06 International Business Machines Corporation Real-time search of vertically partitioned, inverted indexes
KR101695014B1 (en) 2011-08-24 2017-01-10 한국전자통신연구원 Method for building emotional lexical information and apparatus for the same
KR101305535B1 (en) * 2011-08-26 2013-09-06 허순영 Recommendation system for moving picture
US8849826B2 (en) 2011-09-30 2014-09-30 Metavana, Inc. Sentiment analysis from social media content
WO2013059290A1 (en) * 2011-10-17 2013-04-25 Metavana, Inc. Sentiment and influence analysis of twitter tweets
US11599892B1 (en) 2011-11-14 2023-03-07 Economic Alchemy Inc. Methods and systems to extract signals from large and imperfect datasets
KR101672349B1 (en) * 2011-12-27 2016-11-07 한국전자통신연구원 File cloud service apparatus and method
US9401097B2 (en) * 2012-06-29 2016-07-26 Jong-Phil Kim Method and apparatus for providing emotion expression service using emotion expression identifier
KR101700820B1 (en) 2012-07-11 2017-02-01 한국전자통신연구원 Personalized content searching apparatus and method based on user's comment
CN103714063B (en) * 2012-09-28 2017-08-04 国际商业机器公司 Data analysing method and its system
CN104239331B (en) * 2013-06-19 2018-10-09 阿里巴巴集团控股有限公司 A kind of method and apparatus for realizing comment search engine sequence
US10007954B2 (en) * 2013-08-23 2018-06-26 International Business Machines Corporation Managing an initial post on a website
MY184612A (en) 2013-09-27 2021-04-08 Mimos Berhad A system and method for ranking recommendations
CN103559174B (en) * 2013-09-30 2016-03-09 东软集团股份有限公司 Semantic emotion classification characteristic value extraction and system
KR101465756B1 (en) * 2013-12-03 2014-12-03 주식회사 그리핀 Apparatus and method for analyzing emotion and method for recommending movice using the same
KR101794137B1 (en) * 2014-11-06 2017-11-06 아주대학교산학협력단 Data visualization method and system using reference semantic map
KR101602898B1 (en) 2014-11-07 2016-03-11 아주대학교산학협력단 Data visualization method and system using comment data for objects
EP3089053A4 (en) * 2015-01-30 2017-10-11 Ubic, Inc. Data evaluation system, data evaluation method, and data evaluation program
US10901945B2 (en) * 2016-01-05 2021-01-26 The grät Network, PBC Systems and methods concerning tracking models for digital interactions
KR101946022B1 (en) 2016-09-30 2019-02-08 에스케이플래닛 주식회사 Method and apparatus for analysis, display of replies
US10567844B2 (en) * 2017-02-24 2020-02-18 Facebook, Inc. Camera with reaction integration
US10545996B2 (en) * 2017-04-19 2020-01-28 Microsoft Technology Licensing, Llc Impression tagging system for locations
US10225621B1 (en) 2017-12-20 2019-03-05 Dish Network L.L.C. Eyes free entertainment
CN109033433B (en) * 2018-08-13 2020-09-29 中国地质大学(武汉) Comment data emotion classification method and system based on convolutional neural network
US11436292B2 (en) 2018-08-23 2022-09-06 Newsplug, Inc. Geographic location based feed
CN109299226A (en) * 2018-10-25 2019-02-01 北京奇艺世纪科技有限公司 A kind of data processing method and system
CN110399481B (en) * 2019-06-06 2022-04-12 深思考人工智能机器人科技(北京)有限公司 Method and device for screening emotional entity words
CN111538841B (en) * 2020-07-09 2020-10-16 华东交通大学 Comment emotion analysis method, device and system based on knowledge mutual distillation

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100254803B1 (en) * 1997-09-19 2000-05-01 윤덕용 System for korean fulltext information retrieval in computer work-station
US6473729B1 (en) * 1999-12-20 2002-10-29 Xerox Corporation Word phrase translation using a phrase index
KR20060093144A (en) * 2005-02-21 2006-08-24 주식회사 지엑스 A development of contents bank system using a human sensibility eliments
KR100697339B1 (en) * 2005-07-14 2007-03-20 (주)케이테크 Sensory based searching system for color image and System of design simulation using thereof
US8442972B2 (en) * 2006-10-11 2013-05-14 Collarity, Inc. Negative associations for search results ranking and refinement
US7930302B2 (en) * 2006-11-22 2011-04-19 Intuit Inc. Method and system for analyzing user-generated content

Also Published As

Publication number Publication date
WO2009082100A2 (en) 2009-07-02
WO2009082100A4 (en) 2009-10-29
WO2009082100A3 (en) 2009-08-13
US20100262597A1 (en) 2010-10-14
KR20090068803A (en) 2009-06-29
KR100917784B1 (en) 2009-09-21

Similar Documents

Publication Publication Date Title
US20100262597A1 (en) Method and system for searching information of collective emotion based on comments about contents on internet
US9857946B2 (en) System and method for evaluating sentiment
US8135669B2 (en) Information access with usage-driven metadata feedback
US20070250501A1 (en) Search result delivery engine
US20090055394A1 (en) Identifying key terms related to similar passages
US20080059453A1 (en) System and method for enhancing the result of a query
US20090070325A1 (en) Identifying Information Related to a Particular Entity from Electronic Sources
US20130246440A1 (en) Processing a content item with regard to an event and a location
US20070038608A1 (en) Computer search system for improved web page ranking and presentation
US20060155693A1 (en) Domain expert search
US20050114324A1 (en) System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers
US20150261773A1 (en) System and Method for Automatic Generation of Information-Rich Content from Multiple Microblogs, Each Microblog Containing Only Sparse Information
CN109614504A (en) A kind of management system and method for internet electronic book
US20070271228A1 (en) Documentary search procedure in a distributed system
Colomb Information spaces: the architecture of cyberspace
JP6130270B2 (en) Comment list public server, program and method for sorting and specifying comment sets corresponding to media contents
WO2008130482A1 (en) Systems and methods for personalizing a newspaper
Nguyen et al. NowAndThen: a social network-based photo recommendation tool supporting reminiscence
Vidulin et al. Multi-label approaches to web genre identification
JP6653169B2 (en) Keyword extraction device, content generation system, keyword extraction method, and program
Gruhl et al. The web beyond popularity: a really simple system for web scale rss
Lappas et al. Interactive recommendations in social endorsement networks
Oudshoff et al. Knowledge discovery in virtual community texts: Clustering virtual communities
JP2005025418A (en) Question answering device, question answering method, and program
Seo Search using social media structures

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08864512

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 12679011

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08864512

Country of ref document: EP

Kind code of ref document: A2