WO2006093789A3 - Semantic document profiling - Google Patents

Semantic document profiling Download PDF

Info

Publication number
WO2006093789A3
WO2006093789A3 PCT/US2006/006429 US2006006429W WO2006093789A3 WO 2006093789 A3 WO2006093789 A3 WO 2006093789A3 US 2006006429 W US2006006429 W US 2006006429W WO 2006093789 A3 WO2006093789 A3 WO 2006093789A3
Authority
WO
WIPO (PCT)
Prior art keywords
term
document
semantic
profiling
terms
Prior art date
Application number
PCT/US2006/006429
Other languages
French (fr)
Other versions
WO2006093789A2 (en
Inventor
Bernard Scott
Maksim Timofeyev
D Armond Speers
Original Assignee
Applied Linguistics Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/232,898 external-priority patent/US20070073678A1/en
Application filed by Applied Linguistics Llc filed Critical Applied Linguistics Llc
Publication of WO2006093789A2 publication Critical patent/WO2006093789A2/en
Publication of WO2006093789A3 publication Critical patent/WO2006093789A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

A method of semantic profiling of documents comprises receiving a document to be profiled, the document comprising a plurality of terms, for each of at least a portion of the plurality of terms in the document determining a part of speech and a grammatical function of the term, obtaining senses of the term, selecting a sense as a most likely meaning of the term, and calculating an information value of the term, and generating a semantic profile of the document comprising at least some of the calculated information values.
PCT/US2006/006429 2005-02-25 2006-02-24 Semantic document profiling WO2006093789A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US65608805P 2005-02-25 2005-02-25
US60/656,088 2005-02-25
US11/232,898 US20070073678A1 (en) 2005-09-23 2005-09-23 Semantic document profiling
US11/232,898 2005-09-23

Publications (2)

Publication Number Publication Date
WO2006093789A2 WO2006093789A2 (en) 2006-09-08
WO2006093789A3 true WO2006093789A3 (en) 2007-12-06

Family

ID=36941644

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/006429 WO2006093789A2 (en) 2005-02-25 2006-02-24 Semantic document profiling

Country Status (1)

Country Link
WO (1) WO2006093789A2 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5873056A (en) * 1993-10-12 1999-02-16 The Syracuse University Natural language processing system for semantic vector representation which accounts for lexical ambiguity
US20050027512A1 (en) * 2000-07-20 2005-02-03 Microsoft Corporation Ranking parser for a natural language processing system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5873056A (en) * 1993-10-12 1999-02-16 The Syracuse University Natural language processing system for semantic vector representation which accounts for lexical ambiguity
US20050027512A1 (en) * 2000-07-20 2005-02-03 Microsoft Corporation Ranking parser for a natural language processing system

Also Published As

Publication number Publication date
WO2006093789A2 (en) 2006-09-08

Similar Documents

Publication Publication Date Title
AU2003256313A1 (en) A method for comparing a transcribed text file with a previously created file
BRPI0418449A (en) method for noise suppression of a speech signal, speech encoder, and, computer program
WO2010092423A8 (en) Music profiling
WO2007070622A3 (en) Detecting and rejecting annoying documents
WO2005040971A3 (en) System and model for performance value based collaborative relationships
WO2005025116A3 (en) Management of digital content licenses
HUP0301289A3 (en) Method and system in a computer environment, computer-implemented method and computer-readable medium
WO2007041370A3 (en) Using speech recognition to determine advertisement relevant to audio content
WO2007035912A3 (en) Document processing
WO2006017493A3 (en) Approach for creating a tag or attribute in a markup language document
WO2005045623A3 (en) Method and system for serving advertisements
WO2006083987A3 (en) Collaborative web page authoring
WO2006023877A3 (en) Methods, systems, and apparatuses for extended enterprise commerce
WO2008088652A3 (en) Method and system for generating a predictive analysis of the performance of peer reviews
WO2004034232A3 (en) Method and system for selecting between alternatives
WO2007048607A3 (en) Automatic, computer-based similarity calculation system for quantifying the similarity of text expressions
WO2007143223A3 (en) System and method for entity based information categorization
WO2006008733A8 (en) A method for determining near duplicate data objects
NO20053218D0 (en) Methods for Determining Formation and Borehole Parameters Using Fresnel Volume Tomography.
EP1717713A3 (en) Automated document localization and layout method
WO2006002179A3 (en) Evaluating the relevance of documents and systems and methods therefor
BRPI0918767A2 (en) method for determining water saturation in a subsurface formation, and well profiling instrument.
TW200634624A (en) System, apparatus and method of selecting graphical component types at runtime
WO2006039454A3 (en) Technical specification editor
NO20042756L (en) Francis turbine.

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 06735909

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 06735909

Country of ref document: EP

Kind code of ref document: A2