CA2366545A1 - System and method for generating a taxonomy from a plurality of documents - Google Patents
System and method for generating a taxonomy from a plurality of documents Download PDFInfo
- Publication number
- CA2366545A1 CA2366545A1 CA002366545A CA2366545A CA2366545A1 CA 2366545 A1 CA2366545 A1 CA 2366545A1 CA 002366545 A CA002366545 A CA 002366545A CA 2366545 A CA2366545 A CA 2366545A CA 2366545 A1 CA2366545 A1 CA 2366545A1
- Authority
- CA
- Canada
- Prior art keywords
- taxonomy
- documents
- generating
- classifications
- clusters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/917—Text
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99942—Manipulating data structure, e.g. compression, compaction, compilation
Abstract
A system and method for generating a taxonomy (30) is provided in which the taxonomy is generated based on clusters of phrases and a topical library (52). The taxonomy permits a user of a text processing system to rapidly search through a database (18) and find relevant documents since the classifications in the taxonomy are narrow enough to limit the number of documents classified in each classification.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/289,174 US6665681B1 (en) | 1999-04-09 | 1999-04-09 | System and method for generating a taxonomy from a plurality of documents |
US09/289,174 | 1999-04-09 | ||
PCT/US2000/009471 WO2000062203A1 (en) | 1999-04-09 | 2000-04-06 | System and method for generating a taxonomy from a plurality of documents |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2366545A1 true CA2366545A1 (en) | 2000-10-19 |
CA2366545C CA2366545C (en) | 2005-12-20 |
Family
ID=23110369
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002366545A Expired - Fee Related CA2366545C (en) | 1999-04-09 | 2000-04-06 | System and method for generating a taxonomy from a plurality of documents |
Country Status (7)
Country | Link |
---|---|
US (2) | US6665681B1 (en) |
EP (1) | EP1208464A4 (en) |
JP (1) | JP2002541590A (en) |
AU (1) | AU4221200A (en) |
CA (1) | CA2366545C (en) |
HK (1) | HK1047174A1 (en) |
WO (1) | WO2000062203A1 (en) |
Families Citing this family (95)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6665681B1 (en) * | 1999-04-09 | 2003-12-16 | Entrieva, Inc. | System and method for generating a taxonomy from a plurality of documents |
AU2001240061A1 (en) | 2000-03-09 | 2001-09-17 | The Web Access, Inc. | Method and apparatus for organizing data by overlaying a searchable database with a directory tree structure |
US6711558B1 (en) | 2000-04-07 | 2004-03-23 | Washington University | Associative database scanning and information retrieval |
US20020049705A1 (en) * | 2000-04-19 | 2002-04-25 | E-Base Ltd. | Method for creating content oriented databases and content files |
AUPR033800A0 (en) * | 2000-09-25 | 2000-10-19 | Telstra R & D Management Pty Ltd | A document categorisation system |
US7191252B2 (en) | 2000-11-13 | 2007-03-13 | Digital Doors, Inc. | Data security system and method adjunct to e-mail, browser or telecom program |
US8458214B1 (en) * | 2000-11-14 | 2013-06-04 | Ebay Inc. | Taxonomy-based database partitioning |
US7305416B2 (en) * | 2000-12-18 | 2007-12-04 | Hewlett-Packard Development Company, L.P. | Network assembly and method for inserting an identification code |
US20070156665A1 (en) * | 2001-12-05 | 2007-07-05 | Janusz Wnek | Taxonomy discovery |
AUPR958901A0 (en) | 2001-12-18 | 2002-01-24 | Telstra New Wave Pty Ltd | Information resource taxonomy |
US7243092B2 (en) * | 2001-12-28 | 2007-07-10 | Sap Ag | Taxonomy generation for electronic documents |
US6996558B2 (en) | 2002-02-26 | 2006-02-07 | International Business Machines Corporation | Application portability and extensibility through database schema and query abstraction |
EP1487874A4 (en) * | 2002-03-01 | 2007-08-29 | Protemix Discovery Ltd | Falp proteins |
US7650327B2 (en) * | 2002-03-01 | 2010-01-19 | Marine Biological Laboratory | Managing taxonomic information |
US7567953B2 (en) | 2002-03-01 | 2009-07-28 | Business Objects Americas | System and method for retrieving and organizing information from disparate computer network information sources |
US7673234B2 (en) * | 2002-03-11 | 2010-03-02 | The Boeing Company | Knowledge management using text classification |
US7266553B1 (en) | 2002-07-01 | 2007-09-04 | Microsoft Corporation | Content data indexing |
US8335779B2 (en) | 2002-08-16 | 2012-12-18 | Gamroe Applications, Llc | Method and apparatus for gathering, categorizing and parameterizing data |
US7231384B2 (en) * | 2002-10-25 | 2007-06-12 | Sap Aktiengesellschaft | Navigation tool for exploring a knowledge base |
US7047236B2 (en) * | 2002-12-31 | 2006-05-16 | International Business Machines Corporation | Method for automatic deduction of rules for matching content to categories |
US9026901B2 (en) * | 2003-06-20 | 2015-05-05 | International Business Machines Corporation | Viewing annotations across multiple applications |
US8321470B2 (en) * | 2003-06-20 | 2012-11-27 | International Business Machines Corporation | Heterogeneous multi-level extendable indexing for general purpose annotation systems |
GB0315191D0 (en) * | 2003-06-28 | 2003-08-06 | Ibm | Methods, apparatus and computer programs for visualization and management of data organisation within a data processing system |
GB2403636A (en) * | 2003-07-02 | 2005-01-05 | Sony Uk Ltd | Information retrieval using an array of nodes |
US20050278362A1 (en) * | 2003-08-12 | 2005-12-15 | Maren Alianna J | Knowledge discovery system |
US7333997B2 (en) * | 2003-08-12 | 2008-02-19 | Viziant Corporation | Knowledge discovery method with utility functions and feedback loops |
US7756750B2 (en) | 2003-09-02 | 2010-07-13 | Vinimaya, Inc. | Method and system for providing online procurement between a buyer and suppliers over a network |
US7870152B2 (en) * | 2003-10-22 | 2011-01-11 | International Business Machines Corporation | Attaching and displaying annotations to changing data views |
US7617196B2 (en) | 2003-10-22 | 2009-11-10 | International Business Machines Corporation | Context-sensitive term expansion with multiple levels of expansion |
US20050144177A1 (en) * | 2003-11-26 | 2005-06-30 | Hodes Alan S. | Patent analysis and formulation using ontologies |
US20050234738A1 (en) * | 2003-11-26 | 2005-10-20 | Hodes Alan S | Competitive product intelligence system and method, including patent analysis and formulation using one or more ontologies |
US7900133B2 (en) * | 2003-12-09 | 2011-03-01 | International Business Machines Corporation | Annotation structure type determination |
US9288000B2 (en) | 2003-12-17 | 2016-03-15 | International Business Machines Corporation | Monitoring a communication and retrieving information relevant to the communication |
US7243099B2 (en) * | 2003-12-23 | 2007-07-10 | Proclarity Corporation | Computer-implemented method, system, apparatus for generating user's insight selection by showing an indication of popularity, displaying one or more materialized insight associated with specified item class within the database that potentially match the search |
US7870046B2 (en) * | 2004-03-04 | 2011-01-11 | Cae Solutions Corporation | System, apparatus and method for standardized financial reporting |
US8055553B1 (en) | 2006-01-19 | 2011-11-08 | Verizon Laboratories Inc. | Dynamic comparison text functionality |
US7487471B2 (en) * | 2004-07-23 | 2009-02-03 | Sap Ag | User interface for conflict resolution management |
US7533074B2 (en) * | 2004-07-23 | 2009-05-12 | Sap Ag | Modifiable knowledge base in a mobile device |
US7853574B2 (en) * | 2004-08-26 | 2010-12-14 | International Business Machines Corporation | Method of generating a context-inferenced search query and of sorting a result of the query |
US7584161B2 (en) * | 2004-09-15 | 2009-09-01 | Contextware, Inc. | Software system for managing information in context |
US20080059416A1 (en) * | 2004-09-15 | 2008-03-06 | Forbes David I | Software system for rules-based searching of data |
US8051096B1 (en) * | 2004-09-30 | 2011-11-01 | Google Inc. | Methods and systems for augmenting a token lexicon |
US7389282B2 (en) * | 2004-11-02 | 2008-06-17 | Viziant Corporation | System and method for predictive analysis and predictive analysis markup language |
CA2500573A1 (en) * | 2005-03-14 | 2006-09-14 | Oculus Info Inc. | Advances in nspace - system and method for information analysis |
US7634406B2 (en) * | 2004-12-10 | 2009-12-15 | Microsoft Corporation | System and method for identifying semantic intent from acoustic information |
JP2008537225A (en) * | 2005-04-11 | 2008-09-11 | テキストディガー,インコーポレイテッド | Search system and method for queries |
WO2006110853A2 (en) * | 2005-04-12 | 2006-10-19 | Maren Alianna J | System and method for evidence accumulation and hypothesis generation |
CN101305350A (en) * | 2005-06-09 | 2008-11-12 | 惠而浦公司 | Software architecture system and method for communication with, and management of, at least one component within a household appliance |
US8024338B2 (en) * | 2005-08-31 | 2011-09-20 | Brei James E | Systems, methods, and interfaces for reducing executions of overly broad user queries |
US20070136335A1 (en) * | 2005-12-09 | 2007-06-14 | Robert Dionne | Method and system for multiple independent extensions of a concept taxonomy via description logic classification |
US20070174255A1 (en) * | 2005-12-22 | 2007-07-26 | Entrieva, Inc. | Analyzing content to determine context and serving relevant content based on the context |
US8694530B2 (en) | 2006-01-03 | 2014-04-08 | Textdigger, Inc. | Search system with query refinement and search method |
US8379841B2 (en) | 2006-03-23 | 2013-02-19 | Exegy Incorporated | Method and system for high throughput blockwise independent encryption/decryption |
US8019754B2 (en) * | 2006-04-03 | 2011-09-13 | Needlebot Incorporated | Method of searching text to find relevant content |
WO2007114932A2 (en) | 2006-04-04 | 2007-10-11 | Textdigger, Inc. | Search system and method with text function tagging |
US20070276676A1 (en) * | 2006-05-23 | 2007-11-29 | Christopher Hoenig | Social information system |
US7519619B2 (en) * | 2006-08-21 | 2009-04-14 | Microsoft Corporation | Facilitating document classification using branch associations |
US7660793B2 (en) | 2006-11-13 | 2010-02-09 | Exegy Incorporated | Method and system for high performance integration, processing and searching of structured and unstructured data using coprocessors |
US8326819B2 (en) | 2006-11-13 | 2012-12-04 | Exegy Incorporated | Method and system for high performance data metatagging and data indexing using coprocessors |
KR100836878B1 (en) | 2006-11-29 | 2008-06-11 | 한국과학기술정보연구원 | Apparatus and method for allocation of subject or field in information search system |
US8423565B2 (en) * | 2006-12-21 | 2013-04-16 | Digital Doors, Inc. | Information life cycle search engine and method |
US8468244B2 (en) | 2007-01-05 | 2013-06-18 | Digital Doors, Inc. | Digital information infrastructure and method for security designated data and with granular data stores |
US8732197B2 (en) * | 2007-02-02 | 2014-05-20 | Musgrove Technology Enterprises Llc (Mte) | Method and apparatus for aligning multiple taxonomies |
US8280877B2 (en) * | 2007-02-22 | 2012-10-02 | Microsoft Corporation | Diverse topic phrase extraction |
US20080243823A1 (en) * | 2007-03-28 | 2008-10-02 | Elumindata, Inc. | System and method for automatically generating information within an eletronic document |
US7792838B2 (en) * | 2007-03-29 | 2010-09-07 | International Business Machines Corporation | Information-theory based measure of similarity between instances in ontology |
US8271476B2 (en) * | 2007-03-30 | 2012-09-18 | Stuart Donnelly | Method of searching text to find user community changes of interest and drug side effect upsurges, and presenting advertisements to users |
US8275773B2 (en) * | 2007-03-30 | 2012-09-25 | Stuart Donnelly | Method of searching text to find relevant content |
EP2186250B1 (en) | 2007-08-31 | 2019-03-27 | IP Reservoir, LLC | Method and apparatus for hardware-accelerated encryption/decryption |
US9081852B2 (en) * | 2007-10-05 | 2015-07-14 | Fujitsu Limited | Recommending terms to specify ontology space |
US8280892B2 (en) * | 2007-10-05 | 2012-10-02 | Fujitsu Limited | Selecting tags for a document by analyzing paragraphs of the document |
WO2009059297A1 (en) * | 2007-11-01 | 2009-05-07 | Textdigger, Inc. | Method and apparatus for automated tag generation for digital content |
US10733223B2 (en) * | 2008-01-08 | 2020-08-04 | International Business Machines Corporation | Term-driven records file plan and thesaurus design |
US9189478B2 (en) * | 2008-04-03 | 2015-11-17 | Elumindata, Inc. | System and method for collecting data from an electronic document and storing the data in a dynamically organized data structure |
KR100990292B1 (en) | 2008-06-11 | 2010-10-26 | 서강대학교산학협력단 | Method for making tag template, registering tag and searching contents according to OntoSonomy |
US8176042B2 (en) * | 2008-07-22 | 2012-05-08 | Elumindata, Inc. | System and method for automatically linking data sources for providing data related to a query |
US9607324B1 (en) | 2009-01-23 | 2017-03-28 | Zakta, LLC | Topical trust network |
US10007729B1 (en) | 2009-01-23 | 2018-06-26 | Zakta, LLC | Collaboratively finding, organizing and/or accessing information |
US10191982B1 (en) | 2009-01-23 | 2019-01-29 | Zakata, LLC | Topical search portal |
US20100211621A1 (en) * | 2009-02-19 | 2010-08-19 | Yahoo! Inc. | Web-based organization of online advertising content |
WO2010135375A1 (en) | 2009-05-20 | 2010-11-25 | Hotgrinds, Inc. | Semiotic square search and/or sentiment analysis system and method |
US8954893B2 (en) * | 2009-11-06 | 2015-02-10 | Hewlett-Packard Development Company, L.P. | Visually representing a hierarchy of category nodes |
US10068266B2 (en) | 2010-12-02 | 2018-09-04 | Vinimaya Inc. | Methods and systems to maintain, check, report, and audit contract and historical pricing in electronic procurement |
US8577823B1 (en) | 2011-06-01 | 2013-11-05 | Omar M. A. Gadir | Taxonomy system for enterprise data management and analysis |
US10366117B2 (en) | 2011-12-16 | 2019-07-30 | Sas Institute Inc. | Computer-implemented systems and methods for taxonomy development |
US9116985B2 (en) * | 2011-12-16 | 2015-08-25 | Sas Institute Inc. | Computer-implemented systems and methods for taxonomy development |
US20140108006A1 (en) * | 2012-09-07 | 2014-04-17 | Grail, Inc. | System and method for analyzing and mapping semiotic relationships to enhance content recommendations |
US11144994B1 (en) | 2014-08-18 | 2021-10-12 | Street Diligence, Inc. | Computer-implemented apparatus and method for providing information concerning a financial instrument |
US10474702B1 (en) | 2014-08-18 | 2019-11-12 | Street Diligence, Inc. | Computer-implemented apparatus and method for providing information concerning a financial instrument |
US11093706B2 (en) | 2016-03-25 | 2021-08-17 | Raftr, Inc. | Protagonist narrative balance computer implemented analysis of narrative data |
US10467277B2 (en) | 2016-03-25 | 2019-11-05 | Raftr, Inc. | Computer implemented detection of semiotic similarity between sets of narrative data |
US9842100B2 (en) | 2016-03-25 | 2017-12-12 | TripleDip, LLC | Functional ontology machine-based narrative interpreter |
US10643178B1 (en) | 2017-06-16 | 2020-05-05 | Coupa Software Incorporated | Asynchronous real-time procurement system |
US20200341977A1 (en) * | 2019-04-25 | 2020-10-29 | Mycelebs Co., Ltd. | Method and apparatus for managing attribute language |
WO2022185538A1 (en) * | 2021-03-05 | 2022-09-09 | 日本電気株式会社 | Information processing device, information processing method, and program |
Family Cites Families (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4370707A (en) * | 1971-08-03 | 1983-01-25 | Computer Service, Inc. | Computer system for generating architectural specifications and project control instructions |
US5157783A (en) | 1988-02-26 | 1992-10-20 | Wang Laboratories, Inc. | Data base system which maintains project query list, desktop list and status of multiple ongoing research projects |
US5146552A (en) * | 1990-02-28 | 1992-09-08 | International Business Machines Corporation | Method for associating annotation with electronically published material |
US5257185A (en) * | 1990-05-21 | 1993-10-26 | Ann W. Farley | Interactive, cross-referenced knowledge system |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5265065A (en) | 1991-10-08 | 1993-11-23 | West Publishing Company | Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query |
US5483650A (en) * | 1991-11-12 | 1996-01-09 | Xerox Corporation | Method of constant interaction-time clustering applied to document browsing |
US5371807A (en) * | 1992-03-20 | 1994-12-06 | Digital Equipment Corporation | Method and apparatus for text classification |
US5517783A (en) * | 1994-02-14 | 1996-05-21 | Edgar; Dwight A. | Lure container |
US5655116A (en) * | 1994-02-28 | 1997-08-05 | Lucent Technologies Inc. | Apparatus and methods for retrieving information |
US5991709A (en) * | 1994-07-08 | 1999-11-23 | Schoen; Neil Charles | Document automated classification/declassification system |
US5694594A (en) | 1994-11-14 | 1997-12-02 | Chang; Daniel | System for linking hypermedia data objects in accordance with associations of source and destination data objects and similarity threshold without using keywords or link-difining terms |
US5625767A (en) * | 1995-03-13 | 1997-04-29 | Bartell; Brian | Method and system for two-dimensional visualization of an information taxonomy and of text documents based on topical content of the documents |
US5708825A (en) | 1995-05-26 | 1998-01-13 | Iconovex Corporation | Automatic summary page creation and hyperlink generation |
US5768580A (en) * | 1995-05-31 | 1998-06-16 | Oracle Corporation | Methods and apparatus for dynamic classification of discourse |
US5708822A (en) * | 1995-05-31 | 1998-01-13 | Oracle Corporation | Methods and apparatus for thematic parsing of discourse |
JPH0969101A (en) * | 1995-08-31 | 1997-03-11 | Hitachi Ltd | Method and device for generating structured document |
US5826025A (en) | 1995-09-08 | 1998-10-20 | Sun Microsystems, Inc. | System for annotation overlay proxy configured to retrieve associated overlays associated with a document request from annotation directory created from list of overlay groups |
US5819260A (en) * | 1996-01-22 | 1998-10-06 | Lexis-Nexis | Phrase recognition method and apparatus |
JP3643470B2 (en) * | 1997-09-05 | 2005-04-27 | 株式会社日立製作所 | Document search system and document search support method |
US5832499A (en) | 1996-07-10 | 1998-11-03 | Survivors Of The Shoah Visual History Foundation | Digital library system |
US5832495A (en) | 1996-07-08 | 1998-11-03 | Survivors Of The Shoah Visual History Foundation | Method and apparatus for cataloguing multimedia data |
US5920854A (en) * | 1996-08-14 | 1999-07-06 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
US6502191B1 (en) | 1997-02-14 | 2002-12-31 | Tumbleweed Communications Corp. | Method and system for binary data firewall delivery |
JP3655714B2 (en) * | 1996-11-15 | 2005-06-02 | 株式会社ニューズウオッチ | Information filtering apparatus and recording medium |
JP3579204B2 (en) * | 1997-01-17 | 2004-10-20 | 富士通株式会社 | Document summarizing apparatus and method |
US6415319B1 (en) | 1997-02-07 | 2002-07-02 | Sun Microsystems, Inc. | Intelligent network browser using incremental conceptual indexer |
US5963965A (en) * | 1997-02-18 | 1999-10-05 | Semio Corporation | Text processing and retrieval system and method |
US6023697A (en) * | 1997-02-24 | 2000-02-08 | Gte Internetworking Incorporated | Systems and methods for providing user assistance in retrieving data from a relational database |
US5819258A (en) * | 1997-03-07 | 1998-10-06 | Digital Equipment Corporation | Method and apparatus for automatically generating hierarchical categories from large document collections |
US6266681B1 (en) | 1997-04-08 | 2001-07-24 | Network Commerce Inc. | Method and system for inserting code to conditionally incorporate a user interface component in an HTML document |
US5940821A (en) * | 1997-05-21 | 1999-08-17 | Oracle Corporation | Information presentation in a knowledge base search and retrieval system |
US6271843B1 (en) | 1997-05-30 | 2001-08-07 | International Business Machines Corporation | Methods systems and computer program products for transporting users in three dimensional virtual reality worlds using transportation vehicles |
US6233575B1 (en) * | 1997-06-24 | 2001-05-15 | International Business Machines Corporation | Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values |
US6094650A (en) * | 1997-12-15 | 2000-07-25 | Manning & Napier Information Services | Database analysis using a probabilistic ontology |
US5991714A (en) * | 1998-04-22 | 1999-11-23 | The United States Of America As Represented By The National Security Agency | Method of identifying data type and locating in a file |
US6389462B1 (en) | 1998-12-16 | 2002-05-14 | Lucent Technologies Inc. | Method and apparatus for transparently directing requests for web objects to proxy caches |
US6374241B1 (en) * | 1999-03-31 | 2002-04-16 | Verizon Laboratories Inc. | Data merging techniques |
US6665681B1 (en) * | 1999-04-09 | 2003-12-16 | Entrieva, Inc. | System and method for generating a taxonomy from a plurality of documents |
US6424982B1 (en) | 1999-04-09 | 2002-07-23 | Semio Corporation | System and method for parsing a document using one or more break characters |
US6401077B1 (en) | 1999-05-28 | 2002-06-04 | Network Commerce, Inc. | Method and system for providing additional behavior through a web page |
US6519586B2 (en) | 1999-08-06 | 2003-02-11 | Compaq Computer Corporation | Method and apparatus for automatic construction of faceted terminological feedback for document retrieval |
US6571240B1 (en) | 2000-02-02 | 2003-05-27 | Chi Fai Ho | Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases |
US6741981B2 (en) * | 2001-03-02 | 2004-05-25 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration (Nasa) | System, method and apparatus for conducting a phrase search |
US6823333B2 (en) * | 2001-03-02 | 2004-11-23 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for conducting a keyterm search |
US6697793B2 (en) * | 2001-03-02 | 2004-02-24 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for generating phrases from a database |
-
1999
- 1999-04-09 US US09/289,174 patent/US6665681B1/en not_active Expired - Lifetime
-
2000
- 2000-04-06 JP JP2000611203A patent/JP2002541590A/en active Pending
- 2000-04-06 EP EP00921957A patent/EP1208464A4/en not_active Ceased
- 2000-04-06 AU AU42212/00A patent/AU4221200A/en not_active Abandoned
- 2000-04-06 WO PCT/US2000/009471 patent/WO2000062203A1/en active Application Filing
- 2000-04-06 CA CA002366545A patent/CA2366545C/en not_active Expired - Fee Related
-
2002
- 2002-11-29 HK HK02108694.0A patent/HK1047174A1/en unknown
-
2003
- 2003-11-10 US US10/704,138 patent/US7113954B2/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
AU4221200A (en) | 2000-11-14 |
EP1208464A1 (en) | 2002-05-29 |
CA2366545C (en) | 2005-12-20 |
WO2000062203A1 (en) | 2000-10-19 |
US6665681B1 (en) | 2003-12-16 |
US20040148155A1 (en) | 2004-07-29 |
EP1208464A4 (en) | 2006-04-05 |
JP2002541590A (en) | 2002-12-03 |
WO2000062203A9 (en) | 2002-06-13 |
US7113954B2 (en) | 2006-09-26 |
HK1047174A1 (en) | 2003-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2366545A1 (en) | System and method for generating a taxonomy from a plurality of documents | |
Soboroff et al. | Combining content and collaboration in text filtering | |
NZ502332A (en) | A text classification system and method for the analysis and management of text | |
Liu et al. | Cluster-based retrieval using language models | |
Bekkerman et al. | On feature distributional clustering for text categorization | |
Wartena et al. | Topic detection by clustering keywords | |
Mann | Fine-grained proper noun ontologies for question answering | |
EP2178008A3 (en) | Multi-modal information access | |
WO1997008604A3 (en) | Multilingual document retrieval system and method using semantic vector matching | |
Sornil et al. | An automatic text summarization approach using content-based and graph-based characteristics | |
Gelbukh et al. | Use of a weighted topic hierarchy for document classification | |
Fukumoto et al. | An automatic clustering of articles using dictionary definitions | |
De Moraes et al. | University of Houston@ CL-SciSumm 2018. | |
Grobelnik et al. | Efficient visualization of large text corpora | |
Chen et al. | A new differential LSI space-based probabilistic document classifier | |
Filatova et al. | Tell me what you do and I’ll tell you what you are: Learning occupation-related activities for biographies | |
Hernández-Reyes et al. | Document clustering based on maximal frequent sequences | |
Bollegala et al. | Measuring the similarity between implicit semantic relations using web search engines | |
Ahonen-Myka et al. | Data mining meets collocations discovery | |
van Halteren | New feature sets for summarization by sentence extraction | |
Wang et al. | Trajectory based word sense disambiguation | |
Une et al. | Induction of donor-specific unresponsiveness in NIH minipigs following intrathymic islet transplantation | |
Bennabi et al. | An Empirical Study on the effect of weighting schemes and Machine Learning algorithms on the Arabic text Classification | |
Kermanidis et al. | Combining language modeling and LSA on Greek song “words” for mood classification | |
SanJuan et al. | Combining vector space model and multi word term extraction for semantic query expansion |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20190408 |