The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic...http://www.google.de/patents/US20050033711?utm_source=gb-gplus-sharePatent US20050033711 - Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora