Systems and methods for analyzing HTML formatted web pages to automatically identify and extract desired information. A computer algorithm identifies and extracts different pieces of information from different web pages automatically after minimal manual setup. The algorithm automatically analyzes pages...http://www.google.de/patents/US20050273706?utm_source=gb-gplus-sharePatent US20050273706 - Systems and methods for identifying and extracting data from HTML pages