Methods and apparatus, including computer program products, to process an electronic document that includes a non-coded representation of characters of text. Based on text coding information that identifies the characters of the non-coded representation, a coded representation is generated and associated...http://www.google.de/patents/US7765477?utm_source=gb-gplus-sharePatent US7765477 - Searching dummy font encoded text