WO2006093789A3 - Semantic document profiling - Google Patents
Semantic document profiling Download PDFInfo
- Publication number
- WO2006093789A3 WO2006093789A3 PCT/US2006/006429 US2006006429W WO2006093789A3 WO 2006093789 A3 WO2006093789 A3 WO 2006093789A3 US 2006006429 W US2006006429 W US 2006006429W WO 2006093789 A3 WO2006093789 A3 WO 2006093789A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- term
- document
- semantic
- profiling
- terms
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Abstract
A method of semantic profiling of documents comprises receiving a document to be profiled, the document comprising a plurality of terms, for each of at least a portion of the plurality of terms in the document determining a part of speech and a grammatical function of the term, obtaining senses of the term, selecting a sense as a most likely meaning of the term, and calculating an information value of the term, and generating a semantic profile of the document comprising at least some of the calculated information values.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US65608805P | 2005-02-25 | 2005-02-25 | |
US60/656,088 | 2005-02-25 | ||
US11/232,898 US20070073678A1 (en) | 2005-09-23 | 2005-09-23 | Semantic document profiling |
US11/232,898 | 2005-09-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2006093789A2 WO2006093789A2 (en) | 2006-09-08 |
WO2006093789A3 true WO2006093789A3 (en) | 2007-12-06 |
Family
ID=36941644
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2006/006429 WO2006093789A2 (en) | 2005-02-25 | 2006-02-24 | Semantic document profiling |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2006093789A2 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US20050027512A1 (en) * | 2000-07-20 | 2005-02-03 | Microsoft Corporation | Ranking parser for a natural language processing system |
-
2006
- 2006-02-24 WO PCT/US2006/006429 patent/WO2006093789A2/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US20050027512A1 (en) * | 2000-07-20 | 2005-02-03 | Microsoft Corporation | Ranking parser for a natural language processing system |
Also Published As
Publication number | Publication date |
---|---|
WO2006093789A2 (en) | 2006-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2003256313A1 (en) | A method for comparing a transcribed text file with a previously created file | |
BRPI0418449A (en) | method for noise suppression of a speech signal, speech encoder, and, computer program | |
WO2010092423A8 (en) | Music profiling | |
WO2007070622A3 (en) | Detecting and rejecting annoying documents | |
WO2005040971A3 (en) | System and model for performance value based collaborative relationships | |
WO2005025116A3 (en) | Management of digital content licenses | |
HUP0301289A3 (en) | Method and system in a computer environment, computer-implemented method and computer-readable medium | |
WO2007041370A3 (en) | Using speech recognition to determine advertisement relevant to audio content | |
WO2007035912A3 (en) | Document processing | |
WO2006017493A3 (en) | Approach for creating a tag or attribute in a markup language document | |
WO2005045623A3 (en) | Method and system for serving advertisements | |
WO2006083987A3 (en) | Collaborative web page authoring | |
WO2006023877A3 (en) | Methods, systems, and apparatuses for extended enterprise commerce | |
WO2008088652A3 (en) | Method and system for generating a predictive analysis of the performance of peer reviews | |
WO2004034232A3 (en) | Method and system for selecting between alternatives | |
WO2007048607A3 (en) | Automatic, computer-based similarity calculation system for quantifying the similarity of text expressions | |
WO2007143223A3 (en) | System and method for entity based information categorization | |
WO2006008733A8 (en) | A method for determining near duplicate data objects | |
NO20053218D0 (en) | Methods for Determining Formation and Borehole Parameters Using Fresnel Volume Tomography. | |
EP1717713A3 (en) | Automated document localization and layout method | |
WO2006002179A3 (en) | Evaluating the relevance of documents and systems and methods therefor | |
BRPI0918767A2 (en) | method for determining water saturation in a subsurface formation, and well profiling instrument. | |
TW200634624A (en) | System, apparatus and method of selecting graphical component types at runtime | |
WO2006039454A3 (en) | Technical specification editor | |
NO20042756L (en) | Francis turbine. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 06735909 Country of ref document: EP Kind code of ref document: A2 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 06735909 Country of ref document: EP Kind code of ref document: A2 |