WO2009105708A3 - Systems and methods of identifying chunks within multiple documents - Google Patents

Systems and methods of identifying chunks within multiple documents Download PDF

Info

Publication number
WO2009105708A3
WO2009105708A3 PCT/US2009/034771 US2009034771W WO2009105708A3 WO 2009105708 A3 WO2009105708 A3 WO 2009105708A3 US 2009034771 W US2009034771 W US 2009034771W WO 2009105708 A3 WO2009105708 A3 WO 2009105708A3
Authority
WO
WIPO (PCT)
Prior art keywords
document
systems
methods
multiple documents
chunk
Prior art date
Application number
PCT/US2009/034771
Other languages
French (fr)
Other versions
WO2009105708A2 (en
Inventor
Jeffrey M. Dexter
Robert Smik
Danny Hyun
Srinivasa R. Vegeraju
Ilesh H. Garish
Original Assignee
Tigerlogic Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/035,600 external-priority patent/US8126880B2/en
Priority claimed from US12/035,560 external-priority patent/US8001140B2/en
Priority claimed from US12/035,587 external-priority patent/US8924421B2/en
Priority claimed from US12/035,546 external-priority patent/US8078630B2/en
Priority claimed from US12/035,592 external-priority patent/US8001162B2/en
Priority claimed from US12/035,557 external-priority patent/US7933896B2/en
Priority claimed from US12/035,566 external-priority patent/US7937395B2/en
Priority claimed from US12/035,607 external-priority patent/US9129036B2/en
Priority claimed from US12/035,541 external-priority patent/US8145632B2/en
Priority claimed from US12/035,574 external-priority patent/US8359533B2/en
Priority claimed from US12/035,597 external-priority patent/US8924374B2/en
Priority to CA2716345A priority Critical patent/CA2716345A1/en
Application filed by Tigerlogic Corporation filed Critical Tigerlogic Corporation
Priority to AU2009217352A priority patent/AU2009217352B2/en
Priority to EP09712643.7A priority patent/EP2260417A4/en
Publication of WO2009105708A2 publication Critical patent/WO2009105708A2/en
Publication of WO2009105708A3 publication Critical patent/WO2009105708A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Abstract

A computer identifies multiple resource identifiers, each resource identifier corresponding to a document at a respective data source. For at least one of the resource identifiers, the computer retrieves the corresponding document from the respective document source, identifies within the retrieved document a chunk that satisfies one or more user- specified search keywords, and displays the identified chunk and a link to the identified chunk within the document to the user.
PCT/US2009/034771 2008-02-22 2009-02-20 Systems and methods of identifying chunks within multiple documents WO2009105708A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA2716345A CA2716345A1 (en) 2008-02-22 2009-02-20 Systems and methods of identifying chunks within multiple documents
AU2009217352A AU2009217352B2 (en) 2008-02-22 2009-02-20 Systems and methods of identifying chunks within multiple documents
EP09712643.7A EP2260417A4 (en) 2008-02-22 2009-02-20 Systems and methods of identifying chunks within multiple documents

Applications Claiming Priority (22)

Application Number Priority Date Filing Date Title
US12/035,566 US7937395B2 (en) 2008-02-22 2008-02-22 Systems and methods of displaying and re-using document chunks in a document development application
US12/035,592 2008-02-22
US12/035,600 2008-02-22
US12/035,597 US8924374B2 (en) 2008-02-22 2008-02-22 Systems and methods of semantically annotating documents of different structures
US12/035,587 US8924421B2 (en) 2008-02-22 2008-02-22 Systems and methods of refining chunks identified within multiple documents
US12/035,592 US8001162B2 (en) 2008-02-22 2008-02-22 Systems and methods of pipelining multiple document node streams through a query processor
US12/035,560 US8001140B2 (en) 2008-02-22 2008-02-22 Systems and methods of refining a search query based on user-specified search keywords
US12/035,597 2008-02-22
US12/035,574 US8359533B2 (en) 2008-02-22 2008-02-22 Systems and methods of performing a text replacement within multiple documents
US12/035,600 US8126880B2 (en) 2008-02-22 2008-02-22 Systems and methods of adaptively screening matching chunks within documents
US12/035,587 2008-02-22
US12/035,541 US8145632B2 (en) 2008-02-22 2008-02-22 Systems and methods of identifying chunks within multiple documents
US12/035,546 2008-02-22
US12/035,574 2008-02-22
US12/035,546 US8078630B2 (en) 2008-02-22 2008-02-22 Systems and methods of displaying document chunks in response to a search request
US12/035,607 US9129036B2 (en) 2008-02-22 2008-02-22 Systems and methods of identifying chunks within inter-related documents
US12/035,557 US7933896B2 (en) 2008-02-22 2008-02-22 Systems and methods of searching a document for relevant chunks in response to a search request
US12/035,607 2008-02-22
US12/035,566 2008-02-22
US12/035,557 2008-02-22
US12/035,541 2008-02-22
US12/035,560 2008-02-22

Publications (2)

Publication Number Publication Date
WO2009105708A2 WO2009105708A2 (en) 2009-08-27
WO2009105708A3 true WO2009105708A3 (en) 2009-10-15

Family

ID=40986234

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/034771 WO2009105708A2 (en) 2008-02-22 2009-02-20 Systems and methods of identifying chunks within multiple documents

Country Status (4)

Country Link
EP (1) EP2260417A4 (en)
AU (1) AU2009217352B2 (en)
CA (1) CA2716345A1 (en)
WO (1) WO2009105708A2 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9305228B1 (en) 2015-03-20 2016-04-05 Bank Of America Corporation Processing damaged items using image data lift
US9679431B2 (en) 2015-04-15 2017-06-13 Bank Of America Corporation Detecting duplicate deposit items at point of capture
CN105740435A (en) * 2016-01-28 2016-07-06 安徽四创电子股份有限公司 On-line preview design method of document on the basis of distribution
US10394555B1 (en) 2018-12-17 2019-08-27 Bakhtgerey Sinchev Computing network architecture for reducing a computing operation time and memory usage associated with determining, from a set of data elements, a subset of at least two data elements, associated with a target computing operation result
US11546142B1 (en) 2021-12-22 2023-01-03 Bakhtgerey Sinchev Cryptography key generation method for encryption and decryption

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6732086B2 (en) * 1999-09-07 2004-05-04 International Business Machines Corporation Method for listing search results when performing a search in a network
WO2005041065A1 (en) * 2003-10-27 2005-05-06 Koninklijke Philips Electronics N.V. Screen-wise presentation of search results
US20060224554A1 (en) * 2005-03-29 2006-10-05 Bailey David R Query revision using known highly-ranked queries
US20070192293A1 (en) * 2006-02-13 2007-08-16 Bing Swen Method for presenting search results

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8150824B2 (en) * 2003-12-31 2012-04-03 Google Inc. Systems and methods for direct navigation to specific portion of target document
US20080027935A1 (en) * 2005-11-30 2008-01-31 Sahar Sarid Anchored search engine results display

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6732086B2 (en) * 1999-09-07 2004-05-04 International Business Machines Corporation Method for listing search results when performing a search in a network
WO2005041065A1 (en) * 2003-10-27 2005-05-06 Koninklijke Philips Electronics N.V. Screen-wise presentation of search results
US20060224554A1 (en) * 2005-03-29 2006-10-05 Bailey David R Query revision using known highly-ranked queries
US20070192293A1 (en) * 2006-02-13 2007-08-16 Bing Swen Method for presenting search results

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2260417A4 *

Also Published As

Publication number Publication date
EP2260417A4 (en) 2016-08-24
CA2716345A1 (en) 2009-08-27
EP2260417A2 (en) 2010-12-15
AU2009217352A1 (en) 2009-08-27
WO2009105708A2 (en) 2009-08-27
AU2009217352B2 (en) 2015-02-12

Similar Documents

Publication Publication Date Title
WO2009131800A3 (en) Systems and methods of identifying chunks from multiple syndicated content providers
WO2011060231A3 (en) Method and system for grouping chunks extracted from a document, highlighting the location of a document chunk within a document, and ranking hyperlinks within a document
WO2009032107A3 (en) Document search tool
WO2012129149A3 (en) Aggregating search results based on associating data instances with knowledge base entities
CA2902821C (en) System for metadata management
WO2007146100A3 (en) Evaluative information system and method
WO2011019749A3 (en) Presenting comments from various sources
GB2508119A (en) System and method for content syndication service
NO20080376L (en) Ranking function that uses a predetermined click-through distance to a document on a network
EP3718324A4 (en) Methods, network function entities and computer readable media for data collection
WO2009081212A3 (en) Data normalisation for investigative data mining
WO2013006422A3 (en) Systems and methods for creating an annotation from a document
GB2484019A (en) An integrated approach for deduplicating data in a distributed environment that involves a source and a target
WO2010080591A3 (en) Methods and apparatus for content-aware data partitioning and data de-duplication
WO2012154501A3 (en) Hybrid web container for cross-platform mobile applications
WO2011019877A3 (en) Context based resource relevance
GB201209093D0 (en) Method of searching for document data files based on keywords,and computer system and computer program thereof
WO2011066456A3 (en) Methods and systems for content recommendation based on electronic document annotation
WO2009042911A3 (en) Search based data management
WO2009105708A3 (en) Systems and methods of identifying chunks within multiple documents
GB2500160A (en) Replicating data
GB2540700A (en) Merging multiple point-in-time copies into a merged point-in-time copy
WO2011088521A3 (en) Improved searching using semantic keys
WO2009158664A3 (en) Library description of the user interface for federated search results
WO2013055636A3 (en) Recommending data based on user and data attributes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09712643

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2716345

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2009217352

Country of ref document: AU

REEP Request for entry into the european phase

Ref document number: 2009712643

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2009712643

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2009217352

Country of ref document: AU

Date of ref document: 20090220

Kind code of ref document: A