WO2001075678A1 - System and method for evaluating an optimizing web site attributes - Google Patents

System and method for evaluating an optimizing web site attributes Download PDF

Info

Publication number
WO2001075678A1
WO2001075678A1 PCT/US2001/010862 US0110862W WO0175678A1 WO 2001075678 A1 WO2001075678 A1 WO 2001075678A1 US 0110862 W US0110862 W US 0110862W WO 0175678 A1 WO0175678 A1 WO 0175678A1
Authority
WO
WIPO (PCT)
Prior art keywords
source document
usability
requirement
attribute
web site
Prior art date
Application number
PCT/US2001/010862
Other languages
French (fr)
Inventor
Paolo A. Brajnik
Original Assignee
Usablenet, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Usablenet, Inc. filed Critical Usablenet, Inc.
Priority to AU2001251281A priority Critical patent/AU2001251281A1/en
Publication of WO2001075678A1 publication Critical patent/WO2001075678A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Definitions

  • the present invention is directed generally to document processing, and more particularly to evaluating and optimizing the attributes of web site documents.
  • Web sites As used herein, generally refers to one or more interrelated web pages. Although the computer languages for generating these web sites are largely standardized (i.e. HTML, XML, JAVA), the methods for generating them are not. As a result, attributes such as ease-of-use, appearance and navigability may vary greatly from web site to web site.
  • Many operators of commercial web sites generate revenue or receive revenue which in some way corresponds to the number of visitors who access their web sites. For example, advertisers may pay a web site operator to display banner ads to visitors. The price to be paid by the advertisers may be determined based on the average number of visitors to the site per day.
  • the operator of a web site may offer to sell any number of products or services through the site.
  • One or more web pages are presented which include pictures and or textual descriptions of the products and/or services offered. For sales to be accomplished, visitors to the web site will generally need to view these pages before purchasing the product or service. This, in turn, requires that the visitors stay at the web site long enough to view the product/service pages and make the decision to purchase the offered products and services.
  • Web sites which demonstrate poor ease-of-use, appearance and navigability generally do not receive as many visitors as web sites which are considered user-friendly. Such poorly designed sites also retain fewer visitors for a shorter period of time. As a result, web site operators who maintain poorly designed web sites may not be able to effectively compete on the Internet due to the lower revenue generation which results from lower visitation.
  • hypertext markup language (HTML) validation programs have been produced. The purpose of these programs is to ensure that the commands found in the source document of each web page conform to a specific version of HTML.
  • link validating program have been developed which confirm that hypertext links which may placed on a web site to direct a user to a web page that is external to the current web site are valid.
  • these programs do not contemplate reviewing multiple attributes of web sites. These programs further do not contemplate rating multiple attributes of a web site. Also, such programs fail to present suggested corrections for web design errors.
  • the present application is directed to particular features of a method and system for evaluating and optimizing web site attributes.
  • the present invention includes a first method for evaluating a web site.
  • a server or a software program of the present invention receives a source document corresponding to a web site from a user.
  • the source document may have a plurality of tags.
  • the system compares at least one of the plurality of tags to a usability rule and generates a usability rating based on the comparison.
  • a server or program of the present invention receives an identification of a source document corresponding to a web site and a user request to compare the source document to a usability rule.
  • a method and accompanying apparatus for for increasing a usability of a web site begins when a user identifies a web site having a plurality of attributes and sends a request for a comparison of the web site to a plurality of usability rules.
  • a server or computer then presents an identification of a problem with at least one of the attributes in response to the request.
  • FIG. 1 is a block diagram illustrating an exemplary computer network in accordance with an embodiment of the invention
  • FIG. 2 is a block diagram of exemplary components of a server for use with the system of FIG. 1;
  • FIGS. 3 and 4 are a flowchart of an exemplary comparison process in accordance with an embodiment of the present invention.
  • FIGS. 5 and 6 are exemplary screen displays depicting selectable usability rules and variable weights according to one embodiment of the present invention.
  • FIG. 7 is an exemplary screen display of a menu for initiating a web site attribute evaluation according to one embodiment of the present invention
  • FIG. 8 is an exemplary screen display depicting an entry field for designating a source document to review according to one embodiment of the present invention
  • FIG. 9 is an exemplary screen display of a source document to be evaluated according to one embodiment of the present invention.
  • FIG. 10 is an exemplary screen display showing the result of a web site evaluation according to one embodiment of the present invention.
  • FIG. 11 is an exemplary screen display showing an explanation of an identified problem with a web site according to one embodiment of the present invention.
  • FIG. 12 is an exemplary screen display showing a description and suggested correction of an identified problem with a web site according to one embodiment of the present invention.
  • FIG. 13 is an exemplary usability rating generated according to one embodiment of the present invention.
  • FIGS. 1-13 wherein similar components of the present invention are referenced in like manner, preferred embodiments of a method and system for evaluating and optimizing web site attributes are disclosed.
  • FIG. 1 there is depicted an exemplary computer network 1 through which a plurality of remote devices 2, 4, 6 may communicate with a server 16 via network connection 10 in any known manner.
  • computer network 1 is preferably an Internet- based network such as the World Wide Web, it may be any one or more of a local area network (LAN), a wide-area network (WAN), an intranet environment, an extranet environment, a wireless network or any other type of computer network, such as those enabled over public switched telephone networks.
  • LAN local area network
  • WAN wide-area network
  • intranet environment such as those enabled over public switched telephone networks.
  • extranet environment such as those enabled over public switched telephone networks.
  • wireless network any other type of computer network, such as those enabled over public switched telephone networks.
  • Remote terminals 2, 4, 6 may each be any type of computing device, such as a personal computer, a workstation, a network terminal, a hand-held remote access device or any other device that can communicate with the server 16 over the network connection 10. Users may interact with the server 16 in order to submit source document information and the like and receive a usability rating for a web site corresponding to the source document.
  • Server 16 in turn, may include any number of computer servers operative to evaluate source document information in accordance with the present invention. Specific functions and operations of serverl ⁇ are discussed further below.
  • FIG. 2 displayed therein are exemplary components of a computing device, such as server 16. It should be understood that any of customer terminals 2, A and 6 may share similar configurations. However, for sake of brevity, the discussion immediately below will refer to the server 16 only.
  • the primary component of the server 16 is a processor 20, which may be any commonly available microprocessor, such as the PENTIUM III manufactured by INTEL CORP.
  • the processor 20 may be operatively connected to further exemplary components, such as RAM/ROM 26, a clock 28, input/output devices 30, and a memory 22 which, in turn, stores one or more computer programs 24.
  • the processor 20 operates in conjunction with random access memory and read-only memory in a manner well known in the art.
  • the random-access memory (RAM) portion of RAM/ROM 26 may be a suitable number of Single In-line Memory Module (SIMM) chips having a storage capacity (typically measured in kilobytes or megabytes) sufficient to store and transfer, inter alia, processing instructions utilized by the processor 20 which may be received from the application programs 24.
  • the read-only memory (ROM) portion of RAM/ROM 26 may be any permanent non-rewritable memory medium capable of storing and transferring, inter alia, processing instructions performed by the processor 20 during a start-up routine of the server 12.
  • the clock 28 may be an on-board component of the processor 20 which dictates a clock speed (typically measured in MHz) at which the processor 20 performs and synchronizes, inter alia, communication between the internal components of the server 12.
  • the input/output device(s) 30 may be one or more commonly known devices used for receiving operator inputs, network data, and the like and transmitting outputs resulting therefrom. Accordingly, exemplary input devices may include a keyboard, a mouse, a voice recognition unit and the like for receiving operator inputs. Output devices may include any commonly known devices used to present data to an operator of the server 12 or to transmit data over the computer network 1 to a remote user or customer. Accordingly, suitable output devices may include a display, a printer and a voice synthesizer connected to a speaker.
  • Other input/output devices 30 may include a telephonic or network connection device, such as a telephone modem, a cable modem, a T-l connection, a digital subscriber line or a network card, for communicating data to and from other computer devices over the computer network 1 , such as remote terminal 2.
  • a telephonic or network connection device such as a telephone modem, a cable modem, a T-l connection, a digital subscriber line or a network card, for communicating data to and from other computer devices over the computer network 1 , such as remote terminal 2.
  • the communications devices used as input/output devices 30 have capacity to handle high bandwidth traffic in order to accommodate communications with a large number of users.
  • the memory 22 may be an internal or external large capacity device for storing computer processing instructions, computer-readable data, and the like. The storage capacity of the memory 22 is typically measured in megabytes or gigabytes.
  • the memory 22 may be one or more of the following: a floppy disk in conjunction with a floppy disk drive, a hard disk drive, a CD-ROM disk and reader/writer, a DVD disk and reader/writer, a ZIP disk and a ZIP drive of the type manufactured by IOMEGA CORP., and/or any other computer readable medium that may be encoded with processing instructions in a read-only or read-write format. Further functions of and available devices for memory 22 will be apparent.
  • the memory 22 preferably stores, inter alia, a plurality of programs 24 which may be any one or more of an operating system such as WINDOWS 2000 by MICROSOFT CORP, and one or more application programs, such as a web hosting program, which may be necessary to implement the embodiments of the present invention.
  • the programs 24 preferably include evaluation software for evaluating and optimizing web site attributes in accordance with the present invention.
  • the process 31 begins when a user identifies a web site or presents one or more source documents to be evaluated (step 32). This may be accomplished by initiating evaluation software which implements the process 31.
  • the evaluation software may present a screen display 114 in which a dialog box 112 is presented for entering an identification of the web site or files containing source document data to be evaluated, as depicted in FIG. 8.
  • the source document 118 may be written in any suitable format or language, such as HTML, XML and CSS.
  • the corresponding web site may be an e- commerce web site, or any other type of web site which has user-accessible content presented in one or more inter-related source documents 118.
  • a user 2 may transmit a URL or other identifier to a remote server 16 which, in turn, downloads source data from a second server (not shown) that hosts the web site corresponding to the URL.
  • the user 2 may transmit source document data 118 (as shown in screen display 116 of FIG. 9) corresponding to the web site directly to the server 16 from one or more computer-readable files stored in a memory of the user terminal 2.
  • a user 2 may have the evaluation software installed on the server hosting the web site (not shown), in which case network communication over network connection 10 is not necessary. Also, the user 2 may install the evaluation software on a standalone computer and load the source document data from a computer-readable medium.
  • a screen display 100 of exemplary rules are provided in FIGS. 5 and 6.
  • the rules may be presented according to a number of headings 102, such as page rules, graphics rules, navigation rules and structure rules. Other headings or fewer headings 102 may be provided.
  • the screen display may also present individual rule titles 104 which generally describe the rule to be applied.
  • Each rule may further have a weight 106 associated therewith. The weight 106 is used to generate a usability rating as described further below. The weights may be adjusted by selecting the weight 106 and inputting a new number variable as displayed according to checkbox 106 of FIG. 6.
  • weights are to be applied on a scale of one to five. However, other weight systems may be used.
  • a number of checkboxes 105 may be presented by which a user may designate whether particular rules or groups of rules are to be applied. It is further contemplated that a user 2 may define and incorporate their own rules into the evaluation software.
  • the rules to be employed as well as the statistical weights to be applied thereto are based on user surveys in which participants are asked to rate various existing web sites according to ease-of-use, navigability and appearance.
  • An analysis of the questionnaires may provide a guideline to identifying and quantifying those attributes which negatively impact a user's perception of the web site.
  • Other criteria for determining problems, statistical weights, and the like may be used.
  • the processing instructions of the evaluation software next direct the processor 20 of the server 16 to parse each received source document 118 to generate a corresponding tag list, which includes the document tags stored in each source document 118 (step 36).
  • the tags may be written in a format that corresponds to standard web document languages such as HTML or XML.
  • the parsing of a source document may involve transforming a stream of computer-readable data (e.g., representing HTML code) into a useable data structure which is then stored in an accessible memory, such as memory 32 or the RAM component of RAM/ROM 26.
  • Such a data structure preferably represents the content of the page and is optimized for access speed.
  • the evaluation program may then query the data structure to determine what web site attributes are present in the source document 118.
  • the evaluation software generates a model of the entire web site based on the source documents received (step 38).
  • the evaluation software constructs a model of the web site which includes, for example, the interconnections among the various pages of the web site.
  • the site model designates which page corresponds to the home page of the web site and further identifies which is the current web page chosen for the evaluation.
  • the site model is a graphic depiction representing all possible connections between any two pairs of pages in the web site.
  • connections are preferably determined from HTML links presented in the source documents as well as anchors, included images, frames, automatic refresh pages, background images, image map links, and the like included in the web site.
  • the site model particularly helps determine problems with respect to site rules and the like as described further below.
  • the evaluation software may then direct the processor 20 to compare the attributes from the tag list corresponding to a source document 118 to the stored usability rules (step 40).
  • Each usability rule is preferably a set of processing instructions that are applied to each page systematically and which may be applied in any order. Such processing instructions direct the processor 20 to collect needed information from the tag list corresponding to the source document 118, and from the site model.
  • the pairwise rules are generally used to check the consistency between a page chosen as a model and each other page that should conform to the former one. Checking is performed by extracting several features of the model page and verifying that the remaining pages have them as well.
  • the processing instructions may direct the processor 20 to compare tag list data to a specific standard and if the standard is not met, a usability problem is generated and identified to the user (step 42). According to the degree of conformance of such a data with respect to the rule and the weight identified with the rule, the problem is assigned to a severity level.
  • a screen display 120 reporting detected web site attribute problems is depicted in FIG. 10.
  • the evaluation software may present a user 2 with a list of pages 124 from the identified web site 122.
  • a first column 126 presents the number of detected problems associated with each page and the value 128 representing the average number of problems per page.
  • a second column 130 may present a severity rating associated with each page which may be determined based upon the number of problems detected and the weights of the detected problems. The severity may also be represented in a graphical format. The second column may also include the average severity of the web site 132, which may in turn, be used to generate a usability rating for the web site, in any useful manner.
  • a third column 134 may further be presented which lists the number of problems fixed and the number of problems originally present.
  • the user 2 may indicate whether problems identified by the evaluation software are to be corrected (step 44). If so, the process 31 continues to step 46 below. Otherwise the process continues step 54, also described below.
  • the user 2 may select a page listed in column 124 of screen display 120.
  • the evaluation software may then present a dialog box, such as dialog box 136 of FIG. 11, which lists the individual problems associated with the selected page.
  • the dialog box 136 may further contain a window 138 for presenting a description 142, an explanation 139 and one or more suggested corrections 144 for the one or more listed problems.
  • the user 2 may then decide if one or more of the suggested corrections should be implemented (step 46). If the user 2 decides to use the suggested corrections, the user 2 may activate a fix problem command 145, after which the evaluation software inserts the suggested corrections into the applicable source document 118 (step 50). If, however, the user 2 wishes to correct the attributes manually, the user 2 may instead select the edit page command 146 and insert desired replacement code in the source document 118 (step 48). Steps 44-50 may be repeated iteratively, as needed, for each problem identified with the subject web site. Next, the user 2 may decide if the web site is to be re-evaluated (step 52). if so, the process 31 returns to step 40 above.
  • the process 31 continues to step 54 wherein the evaluation software generates a usability rating of the web site.
  • the usability rating is determined in any useful manner by considering the number of problems associated with the web site, the weights associated with those problems, and a percentage of the web site for which there are no demonstrable problems. Other factors may be used to generate the usability rating.
  • the usability rating may take the form of a percentage, but may alternately be described in terms of a number scale, a letter grade or any other useful indicium of web site ease-of-use, navigability, appearance, and/or user-friendliness.
  • the usability rating may be generated by the number of pages having problems with a high weight or severity in comparison to the number of pages of the web site.
  • usability ratings may be assigned in the following manner.
  • a rating of 100% would be assigned to a web site which does not demonstrate catastrophic problems (i.e. those problems having a highest weight), does not demonstrate major problems (those having an intermediate weight) and has less than a predetermined number of minor problems per page(those with the lowest weights).
  • a web site would receive a usability rating of 80% when there are no catastrophic problems, less than a predetermined number (e.g. 3) of major problems per page, and less than a predetermined number (e.g. 10) of minor problems per page.
  • a usability rating of 60% would be applied a web site which contains no catastrophic problems, less than a predetermined number (e.g.
  • a 40% rating would be applied to web sites which do not demonstrate catastrophic problems, but demonstrate higher than predetermined numbers of major and minor problems.
  • a 0% rating would be applied to a web site demonstrating catastrophic problems. Other rating systems may be used.
  • the user 2 may decide whether to place the rating on the web site (step 56). This may be accomplished by inserting appropriate web page formatting commands into a source document of the web site. An example of a displayed usability rating 160 is presented in FIG. 13. If the user 2 decides to insert the rating, the process 31 continues to step 58 where the appropriate instructions are inserted into the source document 118, after which the process 31 ends. If, on the other hand, the user 2 decides not to insert the rating, the process 31 ends without performing step 58
  • the following are rule titles corresponding to rules which may be employed by a method or apparatus of the present invention. Additional rules may be used and fewer rules may be employed.
  • Preferred rules include: (1) Body Color Portability, (2) Portable Font Color, (3) Known Body Color, (4) Known Font Color, (5) No Blinking, (6) Headings Exist, (7) Body Consistency, (8) Body Color Consistency, (9) Consistent Link Colors, (10) Noframes Exists, (11) Noframes With Content, ( 12) Noframes Links, (13) IFRAME Exists, ( 14) Frame Borders, (15) Frame Margin, (16) Frame Targetting, (17) Noresize Frame, (18) Frames Are Bad, (19) Page Size Small, (20) IMG with ALT, (21) Img Alt Short, (22) ALT With Content, (23) IMG With Size, (24) Image Stretching, (25) Forced Download Images, (26) Background Consistency, (27) Description Analysis, (28) Keywords Analysis, (29) Title Analysis, (30) Explicit Mail To, (31 ) VLINK_differs_from_LINK, (32) Non-Standard LINK, (33) Empty Anchor Links, (34) Consistent Navigationals
  • the rule relating to Body Color Portability checks that colors specified in a BODY element of the source document for foreground or background are recognizable by various versions of standard web browsers. Such colors may be those belonging to a predefined list of color names or those specified as RGB values belonging to a specific subset. Using non-standard colors may lead to different renderings between different browsers on displays not supporting 24-bit colors (i.e. millions of colors), thus reducing the graphical quality of the site.
  • the evaluation software To correct a detected problem with this attribute, the evaluation software first extracts the colors specified in the BODY HTML elements of the source document 118 and compares them with the predefined list of acceptable colors or to standard RGB formats. The software then signals a problem for each page that uses colors not satisfying these constraints. Possible fixes for this problem include presenting an interactive tool to choose an acceptable color to substitute for the original, problematic color listed.
  • Portable colors are those belonging to a predefined list of color names or those specified as RGB values belonging to a standard format. Using non-standard colors may lead to differing presentations of a web site among different types of web browsers, especially those which do not support 24-bit colors. This, in turn, may reduce the graphical quality of the web site.
  • the evaluation software compares these colors with a predefined list of standard colors and/or the standard RGB formats. The software signals a problem for each page that contains FONT elements with colors not satisfying those constraints. The software may then suggest a fix for this problem by presenting an interactive tool to choose an acceptable color to substitute for the original color.
  • the list of VGA colors is constituted by 16 color names that all browsers understand. Using symbolic names for colors, except for recognized VGA colors, may lead to different renderings between different browsers, and some may not even be recognized, thus reducing the graphical quality of the site.
  • the evaluation software extracts the colors specified as attributes of the BODY HTML element in the page and compares them with the predefined list and the VGA standard. The software then identifies a problem for each page that uses colors not satisfying these constraints. The software may then suggests a fix for the problem by presenting an interactive tool to choose an acceptable to substitute for the original color.
  • the rule relating to Known Font Color checks that colors specified in a ⁇ FONT
  • the list of VGA colors is constituted by 16 color names that all browsers understand. Using symbolic names for colors, except for the VGA ones, may lead to different renderings between different browsers. Some names may not even be recognized, thus reducing the graphical quality of the site.
  • the rule relating to No Blinking checks that the page does not contain the BLINK element of an HTML document.
  • the BLINK element though being part of the HTML 4.0 standard, is considered an undesirable element because surveys have indicated that blinking animated features on a computer screen have a strong negative impact on a user's peripheral visual perception. In other words, a user's attention is constantly distracted away from the interesting parts of the page. It has been further shown that distracted users will move quickly to other sites, thus decreasing visitor retention times.
  • the evaluation software extracts from the page all the BLINK elements and generates a problem for all of them.
  • the software suggests to fix the problem by substituting the BLINK element with other means to emphasize text, including, for example, inserting a new color, increasing font size, using boldface (e.g. by employing a ⁇ STRONG> tag), or by using italics (e.g. by employing an ⁇ EM> tag).
  • Headings ⁇ H1>, ⁇ H2>, ⁇ H3> are used in html pages to structure text into blocks.
  • the advantages of using headings are: (1) the text may be easier to scan and read (2) the text may be more effectively rendered by speaking machines, and (3) the heading text may used when a search engine indexes the page.
  • the evaluation software first counts the words that constitute the content of a page. If the resulting number is above a predefined threshold, then it extracts all the heading elements. If none are found then a problem is generated. The software then suggests to fix the error by splitting the page content into sections and using heading elements to specify sections titles.
  • the rule relating to Body Consistency is a pairwise rule that checks that the subject page contains a particular body element if such element is found in the model page. If this is not true, then the two pages have a different structure that breaks consistency. Changing structure between pages may confuse readers to the point that they may not know if they are still visiting the same site, or if they got the page they requested. In order to address this problem, the evaluation software extracts BODY elements from the two pages and compares results. If the compared elements are different then a problem is generated. The software then suggests that the problem be fixed by making the subject web page use exactly the same structure as that used by the model page.
  • the rule relating to Body Color Consistency is a pairwise rule that checks that a model page, such as a home page of the web site, and the current page specify the same color for the following components: background, foreground text, visited links, unvisited links and active links. Changing colors between pages may confuse readers to the point that they may not know if they are still visiting the same site, if they got the page they wanted, or if the meaning of colored objects has changed.
  • the evaluation software extracts from the model and the current page the color specified for the considered components. If they differ (or only one of the two pages specify it) then a problem is generated. The evaluation software then suggests that the user replace, add or remove the color used in the current page, accordingly.
  • the rule relating to Consistent Link Colors is a pairwise rule that checks if individual instances of links have colors that are inconsistent between the model and the current pages. Inconsistent use of link colors is a source of confusion for users because they may not easily realize that words printed in a new color link to the same type as those that they have previously seen. In order to address this problem, the rule extracts FONT elements from both pages and the colors specified in those FONT elements. If the two pages differ for the same link type, then the evaluation software signals a problem.
  • the problem may be fixed through a dialog box that shows which elements in the model page use which color and that highlights those elements in the current page using a color not appearing in the model.
  • the rule relating to No Frames checks that if the page contains a FRAMESET element (that is, an element specifying that the page should have a frame-based structure), then the page should also contain a NOFRAMES element. This is because a frameset without ⁇ NOFRAMES> ... ⁇ NOFRAMES> will not be displayed by certain browsers which can not render frames.
  • Such browsers include early versions of well-known browsers as well as certain modern specialized browsers, like speaking machines, browsers for PDAs, and text-only browsers.
  • the evaluation software determines whether the source e document 118 includes a FRAMSET and, if so, whether a NOFRAMES element exists. If it does not exist, then a problem is generated. The software then suggests to fix the problem by adding a NOFRAMES element.
  • a suggestion presented for inserting such a NOFRAMES element may be similar to the following:
  • the rule relating to No Frames With Content checks that if the page contains a NOFRAMES element, then it should contain meaningful text that is to be substituted for the frame, rather than sentences like 'you need a frame enabled browser to view this page'.
  • the NOFRAMES element is intended to be used to provide users of browsers that are not frame- enabled the means to access the site, although with a slightly degraded graphic and interaction quality.
  • a NOFRAMES element containing text stating that the page cannot be rendered by the browser being used does not help the user to access the page content that is framed.
  • the evaluation software collects all the text included in the NOFRAMES element, extracts the words that appear more frequently and removes from the resulting list of words other words that are not significant (like 'browser', 'frame', 'download', etc.). If less than a predefined proportion (e.g. 50%) of the words remain, then a problem is generated since too few significant words have been found.
  • the rule extracts the most frequent words contained in the pages that constitute the FRAMESET components of the current page and shows them to the user. The user can then read those words and perhaps use them to fill the content of the NOFRAMES element.
  • the rule entitled No Frames Links checks that if a page that contains a FRAMESET, and further contains a NOFRAMES element, then the NOFRAMES element should include many of the links contained in the framed pages. This is because the NOFRAMES element is intended to provide users of browsers that are not frame-enabled the means to access the site anyway, although perhaps with a slightly degraded graphic and interaction quality. A NOFRAMES element that does not contain links, will not permit navigation from the page, thus unintentionally preventing a user from exiting the page without deactivating their browser.
  • the evaluation software extracts two lists of links: those connecting the NOFRAMES element with other pages and those connecting the framed pages with other pages. It then compares the lists and if less than, for example, 70% of the links found in framed pages occur in the NOFRAMES element, then a problem is generated. Since it is necessary to determine where in the web page to place the missing links, the user must generally provide a manual fix to this problem.
  • IFRAME Exists checks if an IFRAME element is present in the page. Though included in the HTML 4.0 standard, the IFRAME element (referring to "inline frame") is not supported by NETSCAPE NAVIGATOR versions 2, 3, or 4.1. Nor is it supported by INTERNET EXPLORER verson 2. Thus, users who surf the net with such browsers may not be able to correctly see the page. In order to correct this problem, the evaluation software extracts any IFRAME elements from the page. If some are found, then a problem is generated. Since there is no easy way to simulate the effect of IFRAME using other standard HTML elements, the software suggests that the user fix the problem by either reorganizing the page without using IFRAME or providing an alternative route for users not using INTERNET EXPLORER 3+.
  • the rule entitled Frame Borders checks if the page specifies frame borders in a way that is incompatible between the two major browsers in the market: NETSCAPE NAVIGATOR and INTERNET EXPLORER. This is because these browsers use different HTML coding for determining whether frame borders should be drawn, and the width of the frame.
  • a FRAMESET or FRAME element that uses only one type of coding will be properly viewed only on one type of browser. Thus, users surfing with other than that browser will not receive a properly rendered page.
  • FRAMEBORDER no
  • NETSCAPE NAVIGATOR 3+ and INTERNET EXPLORER 4+ use the format:
  • FRAMEBORDER no
  • the evaluation software checks what combination of HTML code is used to specify frame borders. In particular it checks if apparently redundant attributes are used that can be understood by both major browsers. The code that guarantees that both browser will work is displayed below:
  • the evaluation software may offer a dialog box showing what kind of attributes are used in the page and highlighting what other attributes should be added to guarantee full compatibility with current major browsers.
  • the dialog box also provides the user with the ability to automatically add the missing attributes.
  • the rule entitled Frame Margin determines whether the page specifies frame margins in a way that is compatible with major browsers in the market, for example, MARGINHEIGHT and MARGINWIDTH can be used to specify the size of top/bottom and left/right margins (e.g., the distance between text and border of window).
  • MARGINHEIGHT and MARGINWIDTH can be used to specify the size of top/bottom and left/right margins (e.g., the distance between text and border of window).
  • the rule entitled Frame Targeting concerns a problem where pages linked from within a frame may be displayed in different windows.
  • the TARGET tag in HTML is one attribute controlling such an effect.
  • the evaluation software collects all the links leaving the page, if the page is rendered within a frameset. It then removes from such a list those links pointing towards other framed pages belonging to the same frameset. It further removes those links that are not important. (Important links are those pointing to pages internally and to pages external to the site that are often used within the site. In practice, important links are determined by scanning all the pages of the site, collecting the frequencies with which their destination URLs are used, and selecting those most frequently used. Links connecting two frames are generally not considered.) Next, the rule extracts the TARGET attribute for such links. If the
  • TARGET is '_self or is missing, then a problem is generated (because the link destination will be rendered by browsers within the current frame). If it is '_blank' then another kind of problem is generated. That is, a new browser window will be opened for rendering the link destination. In order to fix this problem, the evaluation software opens a dialog box which shows the list of links having wrong targets suggesting a correct target.
  • the rule entitled Noresize Frame confirms that the FRAME elements of the subject source document 118 do not contain the attribute NORESIZE. This is because users may want to move frames borders to better render web site content within their browser. Often this is needed because the configuration of their browser (e.g., font size, image downloading, etc.) may lead to an inconvenient layout of the frame. By specifying NORESIZE, the width or height of a frame is frozen, thus reducing user flexibility.
  • the evaluation software collects the FRAME elements contained in the page and checks if the NORESIZE attribute is set. If so, it identifies the problem to the user. A dialog box is then opened which shows the non-resizable frames and offers to automatically remove that attribute.
  • the rule entitled Frames are Bad determines whether frames are present in a page and if so simply gives a warning message to the user. Frames suffer from a number of usability drawbacks, such as: (1) framed pages cannot be bookmarked ; (2) users often get confused as to which frame is active; (3) frames cannot be rendered by specialised browsers, like text-based or speaking machines; and (4) framed page are often not indexed by search engines, as they have no recognizable content.
  • Page Size Small determines whether the page can be quickly downloaded by visitors. Many visitors use slow internet connections and will be inconvenienced if the download time for a page is too long.
  • the evaluation software determines the size of all the files involved in the page (e.g., background and other images, framed pages, etc.), computes a sum and then computes the estimated download time for website users that may use different modem speeds (e.g. 14.4, 28.8, 33.6, 56.0 kbps). Special attention is paid if the page being analyzed is a home page of the web site, since this will be the first page a visitor is presented with. If a download time for the page exceeds a predetermined or user-selected threshhold, the evaluation software recommends a reduction in the size of the page. The software may further highlight the size of particular page components, such as images, in order to assist the user in limiting the page size.
  • modem speeds e.g. 14.4, 28.8, 33.6, 56.0 kbps.
  • the rule entitled IMG With ALT determines whether images embedded in a page contain an alternative textual description. Images are not always downloaded by users and rendered by browsers. Image downloading may be disabled in many web browsers in order to increase an access to a web site. Thus, providing an alternative textual description of the image with the ALT attribute helps these users to determine what kind of image they missed and might be interested in. Furthermore, ALT descriptions are usually downloaded and rendered immediately by browsers, whereas the image itself might require several seconds to download and render. Finally, non graphically capable browsers (speaking machines or text-based browsers), are still able to provide some information to users. In order to address this problem, the evaluation software extracts all the IMG elements from a page and then checks if they specify an ALT attribute.
  • ALT string with suitable text If not, a problem is generated and the software provides a recommendation that an ALT string with suitable text be inserted.
  • the rule entitled IMG Alt Short determines whether images embedded in page have an alternative textual description that is not too long. Long strings will typically be truncated by browsers or won't be rendered at all. Therefore, long ALT strings contribute negatively to the download time of a web page.
  • the absolute limit of an ALT string is 1024 characters, but it is reasonable for a web designer to adopt a much smaller limit.
  • the evaluation software presents a dialog in which the ALT string of each collected image is shown, together with the image itself, and the user can edit or reword the string so that it becomes shorter. Then the new string can be saved attached to its image in the current page or in all the occurrences of that image in other pages of the site.
  • the rule entitled ALT With Content determines whether the alternative string associated with an image does not consist of words that simply suggest the user to use a graphic-enabled browser.
  • the ALT attribute should be used to convey some information about the content of the associated graphics.
  • non-graphical browsers e.g., text-based, speaking machines, browsers with image download disabled and browsers for blind people
  • search engines can better index the page.
  • these strings are extracted from the page and words belonging to a specific subset of generic terms (e.g., 'browser', 'graphic', 'enable') are removed.
  • the number of remaining words are compared to the original size of the string, and if the ratio is below a predefined threshold, then a problem is signaled.
  • the software then presents a dialog box in which the ALT string of each collected image is shown, together with the image itself.
  • the user can then edit or re-word the text so that it becomes shorter.
  • the new string can be saved attached to its image in the current page, as well as in all the occurrences of that image in other pages of the sites.
  • the rule entitled IMG With Size determines whether images that are embedded in the current page declare their sizes in HTML. It is important to specify the height and width of an image in order to speed-up the rendering process of the browser. In fact, if such information is missing, the browser needs to download the entire image before drawing it.
  • the evaluation software collects all the embedded images (e.g., IMG elements) and confirms whether they each specify height and width attributes. If at least one of those elements do not specify the height/width attributes, the software identifies the element as an attribute problem. The software then suggests fixing this problem by showing the user a dialog box in which each image is shown with its physical size, if it can be obtained from the file containing the image itself. (This is currently possible for GIF and JPEG files). The user can then input new values for the image geometry or adopt the ones that the software proposes.
  • the embedded images e.g., IMG elements
  • the rule entitled Image Stretching checks that if an embedded image is represented as a GIF file, and the IMG element specifies a height/width attribute of the image, then the value of these attributes should be equal to the physical size of the image that is extracted from the file. If the sizes are not equal, then the image is stretched for presentation on the web site. Stretching images, particularly those stored in GIF format, generally does not work and leads to pictures that are badly rendered.
  • the evaluation software collects the geometry specified in the HTML tag corresponding to the image and the geometry extracted from the file containing the image. If they differ, then a problem is generated. The same process is repeated for all the images embedded in the current page. The software then suggests that this problem be fixed by displaying a dialog box in which each image is shown along with its actual. The user can fix the images as the software proposes, or instead may abort the fixing process.
  • the evaluation software collects the IMG elements from the page that are contained within ⁇ A> elements. A problem with these elements is identified to the user who may manually fix the link data.
  • the rule entitled Background Consistency is a pairwise rule that determines whether a model page and the current page use the same background image. Changing backgrounds between pages may confuse visitors, who may think that they have inadvertently left the site.
  • the evaluation software first determines whether the two pages specify the same background image. If one does and the other does not, then a problem is identified. A problem is also identified if they both specify different background images, which may be determined from file names and sizes associated with the background. The software may then present a dialog box that presents the background images that it has found, and gives the user an option to change the background image of the current page.
  • the rule entitled Description Analysis determines whether the current page contains a description that is suitable to be shown by search engines when they present he page in their hit list. In order to be properly handled by a search engine, the description string should not exceed
  • a rule entitled Keywords Analysis detemines whether the current page contains a set of keywords that an be used by search engines when they index the page.
  • the list of keywords should be shorter than 1024 characters and should not contain HTML tags.
  • the keyword list is important for search engines when this page is indexed because it affects whether the page is retrieved and where the page is positioned in a ranked list of search results computed by the search engines.
  • the evaluation software determines whether the page contains a
  • the rule entitled Title Analysis confirms that the page has a title and that it is not longer than the maximum length allowed for a title.
  • the page title is used in bookmark lists and by search engines to display results lists. Titles that are too long are truncated. Thus, it is important that titles are brief, informative and contextual, so that the page can be bookmarked and clearly identified.
  • the evaluation software extracts the TITLE element, if any. If no such attribute is found, then a problem is generated. If it is found, then its content is analyzed to determine if the title length is longer than 64 characters. If so, a second problem is identified.
  • the software then presents a dialog box through which a user may define a new title, or change the content of the current title.
  • the user inputs the desired string and an appropriate TITLE element is inserted in the page.
  • the rule entitled Explicit Mail To relates to MAILTO: links which enable a user to send an email message by selecting a link. Accordingly, the link should contain an actual email address.
  • the evaluation software extracts all the labels of all the links having MAILTO: as protocol. It then checks if the email address specified in the URL of the link is present in the text of the ⁇ A> element containing it. If not, a problem is identified. To fix this kind of problems the user can open a dialog box in which the email address and the label of each MAILTO: link may be entered. By pressing a button the label can be changed to the address and then saved.
  • VLINK_Differs_From_LINK confirms that a color used for visited links is different than the one used for non-visited links.
  • Many users rely on the fact that links that have been already followed change their color. In this way they can tell if part of a site has already been visited.
  • the evaluation software reads the values of the attributes LINK and VLINK from the BODY element of the page, and compares them. If they are the same color (even if their name is different, like for 'BLACK' and '#000000', which are two ways to specify the same color) a problem is generated.
  • a dialog can be opened which presents the two attributes and their values, as well as a color list from which the user can choose the desired colors. The conventional colors are highlighted, so that the user can select also those that are typical for websites.
  • Non-Standard LINK determines whether the page uses a conventional color for displaying visited and non visited links, respectively blue and purple. Many users rely on the fact that followed links are purple and new links are blue.
  • the evaluation software extracts the BODY element from the page and confirms whether it contains the attributes LINK and VLINK. If it does and the two colors are different than the conventional ones, then a problem is identified. A dialog box may then be opened which enables the user to set the default colors for these attributes.
  • the evaluation software extracts from the page all the A elements and checks what they have as labels. Those having an image or those having a nonempty string are skipped. The remaining elements are then identified as problematic.
  • a dialog can be used to fix this problem.
  • the dialog box displays the list of links. The user can then type the label for each of those links and save them within the source document 118.
  • the rule entitled Consistent Navigationals is a pairwise rule which determines whether the navigationals that are used in the model page are also used in the current page.
  • Navigationals are defined as those links pointing to pages within the site and pages external to the site that are often used.
  • important links are determined by scanning all the pages of the site, collecting the frequencies with which their destination URLs are used, and selecting those most frequently used. Links connecting two frames are not considered.
  • links that are often used in the site are missing from a page, it may mean that an inconsistent navigational structure is present in the page. If this is the case, then users will have a hard time exploring the site because relevant paths are not available and paths that users have already seen in previous pages no longer exist.
  • the evaluation software examines all the links leaving the model page and those leaving the current page. It then considers only those links that are navigationals. If the remaining sets differ substantially, a problem is identified. To fix the problem the user needs to decide which of the navigational links that are missing in the current page have to be added.
  • the rule entitled Reachable Home Page determines whether the page contains a link to the site home page. If it does not, it may pose a major navigational obstacle for users who, for example, bookmark and visit an internal web page, but wish to visit the home page for the site therefrom.
  • the evaluation software collects from the site model all the pages that are connected to the current one. If the home page does not appear among such a list, then a problem is identified. To fix this kind of problem, the software prompts the user to determine where and how to include a link to the home page.
  • the rule provides a template for a textual link to the home page, but the actual fixing itself has to be performed by the user because of the determinations that have to be made.
  • the rule entitled Logical Path Rule determines which is the shortest path from the home page to the current page that a visitor can follow by clicking on links. The software then checks whether from the current page there a link to each of the intermediate pages in such a path. Users may forget the route they followed to reach this page.
  • the evaluation software scans all the links present in the page (e.g., those specified as textual ⁇ A ⁇ or embedded in images ⁇ AREA --) and checks whether they are included in the shortest path connecting the home page and the current page. If some are missing then a problem is identified. The problem can then be manually fixed by selecting the navigation structure that the software presents to the user and including it in the page. In this case, automatic fixing of the source document is not possible since adding links may require substantial reorganization of the page.
  • all the links present in the page e.g., those specified as textual ⁇ A ⁇ or embedded in images ⁇ AREA --
  • Isolated Pages is used to determine whether the current page can be reached from the home page. If isolated pages cannot be reached by users that start from the home page, then such users will not be able to view the page, even if it contains important material. The only way they would be able to view this page is via a search engine or through some bookmark. Isolated pages may occur due to spelling errors in links of other pages.
  • the evaluation software searches the site model for paths connecting the home page to the current one. If none is found (within preset time limits, which may be selected by users) then the page is declared to be unreachable and a problem is identified. To fix this problem the user needs to reorganize part of the site, which has to be done manually.
  • the rule entitled Self Referential Page is used to determine whether the page contains links that point to itself. Such links are generally useless.
  • the rule entitled Broken Links is used to determine whether links contained in the page point to resources that are valid. The presence of invalid links on the page may confuse and irritate users. Such links may be the result of a typing error in the listed URL.
  • the evaluation software collects all the links pointing to resources external to the page (including URLs to other pages and images embedded in the page). If the links point to resources that are locally stored, then these resources are checked. If they are not accessible to the software, then they are assumed to be inaccessible, and a problem is identified. If links point to resources that are external to the site (i.e. they reside on a web server that is different from the one serving the current page), then these links are probed. If this fails because of network problems then nothing is done. If probing instead fails because the resources cannot be accessed, then a further problem is identified. Fixing this problem requires checking and modifying the URL of the broken links and possibly removing the link from the page if the resource does not exist anymore. A dialog box can be presented so that a user may perform both these activities.
  • the rule entitled Click Counting is used to determine whether the number of clicks that a user needs to perform to reach the current page starting from the home page is greater than a given threshold (which may be predefined by the user). An inordinate number of clicks may deter users from reaching and reading the page. Furthermore, it may confuse novice users because they would not be able to find what they need, leading them to conclude that the site does not contain such information.
  • the evaluation software searches the path in the site model and counts the number of necessary clicks. If it exceed the threshold a problem is identified. To fix such a problem, site reorganization by the user is required.
  • the rule entitled Http Equiv Refresh Is Bad is used to determine whether the page contains HTML code that has the effect of automatically loading another page after some time has elapsed from the time when the current page was first selected by the browser.
  • Such automatic loading of a new page has a number of drawbacks: (1) it may disorient users while they are reading the content of this page; (2) it may not allow users to examine and follow links present in this page; and (3) it reduces the effectiveness of the browser's BACK button (which is the second most used features by web users, after link following).
  • the evaluation software determines whether the page has an element similar to the following one:
  • the evaluation software checks if page has an element similar to the following one:
  • the rule entitled Http Equiv Refresh Backward determines whether the current page is the destination of some automatic redirection taking place in some other page. If so, then the current page should contain the link pointing back to such a page. This is because users with a slow connection will not be given the time to read that page and perhaps follow its links. The new page (i.e. the current one) will be automatically downloaded instead, and no links to the previous page are provided to the user.
  • the rule offers a dialog showing the relevant information and enabling the user to automatically remove the META element from the other page.
  • Unvisible Image Maps is used to determine whether in the HTML code there are structures that are useless because no user would ever see them displayed by any sort of browser.
  • web designers use cut-and-paste methods to create new pages from old ones. It sometimes happens that the new page contains some 'left over' elements, like image maps (i.e. sets of links) without having an associated image. Such MAP elements do not appear in any image and are useless.
  • Unused HTML code contributes to the page size and therefore its download time.
  • the evaluation software is programmed to scan the page and searches for MAP elements that are not associated to any images in the page. If such elements are found, a problem is identified to the user 2. The software or the user 2 may fix the problem by deleting the unneeded element.
  • Imagemap Links is used to determine whether links present in image maps are duplicated as textual links. Links specified by the image map should also appear as normal textual links, for three reasons: (1) many users browse the web with "image loading" disabled; such users would not be able to follow your links; (2) many browsers are not capable of rendering all the graphics (e.g. speaking machines, microbrowsers on hand-held devices) and therefore those links would not be accessible; and (3) imagemap links are not followed by all search engines, and therefore pages they point to would not be indexed.
  • the evaluation software is programmed to scan the page and extract all the visible AREA elements and their link lists. It then checks that these links appear also within A elements elsewhere in the page. It collects those that do not appear elsewhere and identifies them as a problem. Fixing this problem requires the addition of one or more A elements with these, links. Since user input is involved in the correction, the software presents all the relevant information to the user who then may modify the page using other tools.
  • the rule entitled Show Links is a rule that displays some information about the page to the user. It does not identify usability problems but instead presents some statistics about the page. In particular it shows which are the links (of any sort) leaving the page. It displays their label (if textual) and their destination URL, and highlights local links.
  • the rule entitled Show Paths is a rule that displays some information about the page to the user. It does not identify usability problems, but instead presents some statistics about the page. In particular it shows a number (by default 50) of the shortest paths connecting the home page to the current page.
  • the rule entitled Show Site Navigationals is a site rule that displays some information about the site to the user. It does not identify usability problems but instead provides some statistics about the site. In particular it shows the list of local links that are important navigationals for the site. Links connecting two frames are not considered.
  • the rule entitled Max Clicks Number is a site rule that checks the average number of clicks on links that a user has to perform in order to reach each page. If the average number is above a predefined threshold (set in the preferences), then a problem is signaled. If the site contains many pages that, in order to be reached from the home page, require too many links to be followed, this may deter users from reaching and reading those pages. Furthermore, it may confuse novice users because they would not be able to find what they need, leading them to conclude that the site does not contain such information. In order to address these problems, the evaluation software computes the length of the shortest path from the home to every other site page. It then computes the average. It finally compares such a value with a predefined threshold.
  • the rule entitled List Of Images is a site rule that displays a list of the images used by pages of the site. It displays each image's URL and the pages that use that image. No usability problem is associated with this rule. However, it is helpful to use to track images used on the web site.
  • the rule entitled Recycle Graphics is a site rule which confirms that image files on the site can be cached by a browser. Generally, different pages pointing to the same image should refer to the same file. Images that are included in web pages, before being rendered, are cached by the user browser in order to avoid downloading them when the same image has to be displayed the next time. However, if the image is referred to by different names (i.e. different URLs), each reference is cached separately and hence has to be downloaded separately, thus increasing the download time the user has to wait.
  • the evaluation software In order to address this problem, the evaluation software generates a table which maps the name and size of every image file on a site to a list of URLs referring to those files. The table is then used to check that image files are reused instead of copied.
  • the software displays the images that are stored in duplicative files, including their size and the pages that use them. A dialog box can then be presented so that the user may replace or edit the URLs so that all requisite pages point to the same image file.
  • the rule entitled Email Consistency is a site rule that checks whether each e-mail address included in a link element has the same label throughout the site. Inconsistent labels for e-mail addresses may confuse users of the site, requiring them to spend time and effort to investigate which of the apparently different links should be followed.
  • the rule entitled PreFilteredPage is an HTML validation rule. It checks whether the values associated to elements attribute is correctly written. In particular it checks if the value requires double quotes. The rule also automatically fixes this problem.
  • Certain browsers may fail to process such pages because their HTML does not satisfy the standard (HTML 3.2 or HTML 4.0).
  • the software of the present invention scans the page and locates attribute values that are violating the above-mentioned requirement. It then may automatically add double quotes to prevent browser from accessing such commands and re-saves the source document.
  • the rule entitle Show Parser Errors is used to report the HTML errors that were found when parsing the page and those elements within the source document 118 that depend on the use of non-standard HTML. If the HTML contained in the page is not conformed to a current standard then some browsers may render the page in an unpredictable way.
  • the evaluation software compares each HTML structure used in the page to its standard definitions which is mentioned in the DOCTYPE specification in the page. If it is missing then in the preferences the user can select what standard to use. Each error is described in a concise way, as well as its location in the HTML code. To fix these kind of problems extensive re-coding may be required. The software, therefore, enables the user to open an HTML editor positioned in the error location.
  • the program scans the page searching for marquee elements. If any are found, then a problem is identified.
  • the software shows in detail the MARQUEE elements that it has found and recommends that they be disabled or deleted.
  • the rule entitled SpacerlsBad is used to determine whether the current page contains a
  • SPACER element which creates a blank space.
  • ⁇ SPACER> .. ⁇ /spacer> can only be used with NETSCAPE NAVIGATOR. It is not a standard HTML 4.0 command. Thus, a page incorporating this command may not be properly displayed by some browsers.
  • the software scans the page searching for SPACER elements. If any are found, then a problem is identified.
  • the software shows in detail the SPACER elements that it has found and proposes their replacements with alternative (but more standard) ways to achieve the same effect (eg. using ' ' or transparent gifs).
  • a dialog box can be presented to a user so that they may replace the SPACER elements with selected alternatives.
  • a user may modify both the number of usability rules to be employed as well as the weight or severity of each rule after an initial analysis, according to process 31, is performed.
  • the user may adjust both the number of rules used and the weight or severity of any number of the rules, and then reanalyze the web site, all in an iterative manner. This may be performed so as to tailor the analysis to web sites with particularized functionality.
  • the method and apparatus of the present invention is further contemplated to include a recorder feature and a downloading feature.
  • the recorder feature can be used to receive dynamic web pages (e.g. those created by a web server on the fly) that are generated in response to user inputs.
  • the recorder may copy the generated document and store it as a static file for analysis.
  • the download feature can be used to download one or more source documents based on a submitted URL.
  • the download feature may be customized to include user-definable features such as how links in the source documents are to be followed.
  • An authoring plug-in feature is also contemplated to be included in the present invention.
  • This plug-in may be used to interface with web authoring software applications to receive source document information. This feature then enables a user to fix and modify proprietary web page templates used by such wed authoring software.
  • a server log analysis feature is further contemplated to be used with the present invention.
  • the server log analysis plug-in may use server logs to decipher user activity on a web site and identify problematic attributes therefrom.
  • the server log plug-in may be adapted to accommodate any format used by a web server to monitor such user activity.
  • the server plug-in may contain a separate set of rules from those described previously which employ server log information to identify attribute problems.
  • a customized rules plug-in feature allows a user to generate and use customized rules which the user may develop to identify problems with web site attributes.
  • the method and apparatus of the present invention may be adapted to present the user with a specialized user interface for entering new rule criteria.
  • the computer language of the new rules does not have to match the language in which the predefined rules are stored.
  • This feature may further allow a user to remotely store new usability rules and to instruct a server performing an analysis where to locate the stored rules on a network or the Internet.

Abstract

A software program and computing devices incorporating the same are provided for evaluating usability features of a web site, including site navigability, download speed, and graphic quality. The software receives a group of source documents for a web site (32), parses the code to generate a web site model (36), and compares the model to a series of usability rules (38). The usability rules may be ranked or weighted according to the importance of the web site feature associated with the rule. Users may define new usability rules and adjust weights to be applied to the rules. The software program then presents a usability rating of the web site and recommends or implements fixes to the source document which will increase the usability rating of the web site (42).

Description

TITLE OF THE INVENTION:
System and Method for Evaluating and Optimizing Web Site Attributes
Cross-reference to Related Patent Applications
This application claims priority to co-pending Italian Patent Application No. UD99A00213 entitled "PROCESSO VI CONTROLLO E GESTIONE VI SITO INFORMATICO" filed on December 9, 1999 in the name of SYA srl, the entirety of which is herein incorporated by reference.
Field of the Invention
The present invention is directed generally to document processing, and more particularly to evaluating and optimizing the attributes of web site documents.
Background of the Invention
Since its introduction, the Internet has increased in popularity in terms of both the number of users accessing the Internet, as well as with the number of entities who launch and maintain web sites thereon. Web sites, as used herein, generally refers to one or more interrelated web pages. Although the computer languages for generating these web sites are largely standardized (i.e. HTML, XML, JAVA), the methods for generating them are not. As a result, attributes such as ease-of-use, appearance and navigability may vary greatly from web site to web site.
Many operators of commercial web sites generate revenue or receive revenue which in some way corresponds to the number of visitors who access their web sites. For example, advertisers may pay a web site operator to display banner ads to visitors. The price to be paid by the advertisers may be determined based on the average number of visitors to the site per day. In another example, the operator of a web site may offer to sell any number of products or services through the site. One or more web pages are presented which include pictures and or textual descriptions of the products and/or services offered. For sales to be accomplished, visitors to the web site will generally need to view these pages before purchasing the product or service. This, in turn, requires that the visitors stay at the web site long enough to view the product/service pages and make the decision to purchase the offered products and services.
Web sites which demonstrate poor ease-of-use, appearance and navigability generally do not receive as many visitors as web sites which are considered user-friendly. Such poorly designed sites also retain fewer visitors for a shorter period of time. As a result, web site operators who maintain poorly designed web sites may not be able to effectively compete on the Internet due to the lower revenue generation which results from lower visitation.
Several attempts have been made to evaluate particular features of a web site. For example, hypertext markup language (HTML) validation programs have been produced. The purpose of these programs is to ensure that the commands found in the source document of each web page conform to a specific version of HTML. In another example, link validating program have been developed which confirm that hypertext links which may placed on a web site to direct a user to a web page that is external to the current web site are valid. However, these programs do not contemplate reviewing multiple attributes of web sites. These programs further do not contemplate rating multiple attributes of a web site. Also, such programs fail to present suggested corrections for web design errors. Thus, it would be advantageous to develop a method and system for evaluating and optimizing web site attributes so as to promote increased visitation and user retention, thereby increasing revenues and or popularity of a web site.
Summary of the Invention
The present application is directed to particular features of a method and system for evaluating and optimizing web site attributes. In particular, the present invention includes a first method for evaluating a web site. A server or a software program of the present invention receives a source document corresponding to a web site from a user. The source document may have a plurality of tags. The system compares at least one of the plurality of tags to a usability rule and generates a usability rating based on the comparison. In a second embodiment of the present invention, a server or program of the present invention receives an identification of a source document corresponding to a web site and a user request to compare the source document to a usability rule. In response, the source document is compared to the usability rules and the user receives a usability rating of the web site in response to the request. In a third embodiment of the present invention a method and accompanying apparatus for for increasing a usability of a web site begins when a user identifies a web site having a plurality of attributes and sends a request for a comparison of the web site to a plurality of usability rules. A server or computer then presents an identification of a problem with at least one of the attributes in response to the request.
Brief Description of the Drawings
Further aspects of the present invention will be more readily appreciated upon review of the detailed description of the preferred embodiments included below when taken in conjunction with the accompanying drawings, of which:
FIG. 1 is a block diagram illustrating an exemplary computer network in accordance with an embodiment of the invention;
FIG. 2 is a block diagram of exemplary components of a server for use with the system of FIG. 1;
FIGS. 3 and 4 are a flowchart of an exemplary comparison process in accordance with an embodiment of the present invention;
FIGS. 5 and 6 are exemplary screen displays depicting selectable usability rules and variable weights according to one embodiment of the present invention;
FIG. 7 is an exemplary screen display of a menu for initiating a web site attribute evaluation according to one embodiment of the present invention; FIG. 8 is an exemplary screen display depicting an entry field for designating a source document to review according to one embodiment of the present invention;
FIG. 9 is an exemplary screen display of a source document to be evaluated according to one embodiment of the present invention; FIG. 10 is an exemplary screen display showing the result of a web site evaluation according to one embodiment of the present invention;
FIG. 11 is an exemplary screen display showing an explanation of an identified problem with a web site according to one embodiment of the present invention;
FIG. 12 is an exemplary screen display showing a description and suggested correction of an identified problem with a web site according to one embodiment of the present invention; and
FIG. 13 is an exemplary usability rating generated according to one embodiment of the present invention.
Detailed Description of the Invention
Referring now to FIGS. 1-13, wherein similar components of the present invention are referenced in like manner, preferred embodiments of a method and system for evaluating and optimizing web site attributes are disclosed.
Turning now to FIG. 1 , there is depicted an exemplary computer network 1 through which a plurality of remote devices 2, 4, 6 may communicate with a server 16 via network connection 10 in any known manner. Although computer network 1 is preferably an Internet- based network such as the World Wide Web, it may be any one or more of a local area network (LAN), a wide-area network (WAN), an intranet environment, an extranet environment, a wireless network or any other type of computer network, such as those enabled over public switched telephone networks.
Remote terminals 2, 4, 6 may each be any type of computing device, such as a personal computer, a workstation, a network terminal, a hand-held remote access device or any other device that can communicate with the server 16 over the network connection 10. Users may interact with the server 16 in order to submit source document information and the like and receive a usability rating for a web site corresponding to the source document. Server 16, in turn, may include any number of computer servers operative to evaluate source document information in accordance with the present invention. Specific functions and operations of serverlό are discussed further below.
Turning now to FIG. 2, displayed therein are exemplary components of a computing device, such as server 16. It should be understood that any of customer terminals 2, A and 6 may share similar configurations. However, for sake of brevity, the discussion immediately below will refer to the server 16 only.
The primary component of the server 16 is a processor 20, which may be any commonly available microprocessor, such as the PENTIUM III manufactured by INTEL CORP. The processor 20 may be operatively connected to further exemplary components, such as RAM/ROM 26, a clock 28, input/output devices 30, and a memory 22 which, in turn, stores one or more computer programs 24.
The processor 20 operates in conjunction with random access memory and read-only memory in a manner well known in the art. The random-access memory (RAM) portion of RAM/ROM 26 may be a suitable number of Single In-line Memory Module (SIMM) chips having a storage capacity (typically measured in kilobytes or megabytes) sufficient to store and transfer, inter alia, processing instructions utilized by the processor 20 which may be received from the application programs 24. The read-only memory (ROM) portion of RAM/ROM 26 may be any permanent non-rewritable memory medium capable of storing and transferring, inter alia, processing instructions performed by the processor 20 during a start-up routine of the server 12.
The clock 28 may be an on-board component of the processor 20 which dictates a clock speed (typically measured in MHz) at which the processor 20 performs and synchronizes, inter alia, communication between the internal components of the server 12.
The input/output device(s) 30 may be one or more commonly known devices used for receiving operator inputs, network data, and the like and transmitting outputs resulting therefrom. Accordingly, exemplary input devices may include a keyboard, a mouse, a voice recognition unit and the like for receiving operator inputs. Output devices may include any commonly known devices used to present data to an operator of the server 12 or to transmit data over the computer network 1 to a remote user or customer. Accordingly, suitable output devices may include a display, a printer and a voice synthesizer connected to a speaker.
Other input/output devices 30 may include a telephonic or network connection device, such as a telephone modem, a cable modem, a T-l connection, a digital subscriber line or a network card, for communicating data to and from other computer devices over the computer network 1 , such as remote terminal 2. In an embodiment involving a network server, it is preferred that the communications devices used as input/output devices 30 have capacity to handle high bandwidth traffic in order to accommodate communications with a large number of users. The memory 22 may be an internal or external large capacity device for storing computer processing instructions, computer-readable data, and the like. The storage capacity of the memory 22 is typically measured in megabytes or gigabytes. Accordingly, the memory 22 may be one or more of the following: a floppy disk in conjunction with a floppy disk drive, a hard disk drive, a CD-ROM disk and reader/writer, a DVD disk and reader/writer, a ZIP disk and a ZIP drive of the type manufactured by IOMEGA CORP., and/or any other computer readable medium that may be encoded with processing instructions in a read-only or read-write format. Further functions of and available devices for memory 22 will be apparent. The memory 22 preferably stores, inter alia, a plurality of programs 24 which may be any one or more of an operating system such as WINDOWS 2000 by MICROSOFT CORP, and one or more application programs, such as a web hosting program, which may be necessary to implement the embodiments of the present invention. The programs 24 preferably include evaluation software for evaluating and optimizing web site attributes in accordance with the present invention.
Turning now to FIGS. 3 and 4, with continuing references to FIGS. 5-13, an exemplary process 31 for evaluating and optimizing web site attributes is presented. The process 31 begins when a user identifies a web site or presents one or more source documents to be evaluated (step 32). This may be accomplished by initiating evaluation software which implements the process 31. The evaluation software may present a screen display 114 in which a dialog box 112 is presented for entering an identification of the web site or files containing source document data to be evaluated, as depicted in FIG. 8. The source document 118 may be written in any suitable format or language, such as HTML, XML and CSS. The corresponding web site may be an e- commerce web site, or any other type of web site which has user-accessible content presented in one or more inter-related source documents 118.
In an online embodiment of the present invention, a user 2 may transmit a URL or other identifier to a remote server 16 which, in turn, downloads source data from a second server (not shown) that hosts the web site corresponding to the URL. Alternatively, the user 2 may transmit source document data 118 (as shown in screen display 116 of FIG. 9) corresponding to the web site directly to the server 16 from one or more computer-readable files stored in a memory of the user terminal 2.
In an offline embodiment, a user 2 may have the evaluation software installed on the server hosting the web site (not shown), in which case network communication over network connection 10 is not necessary. Also, the user 2 may install the evaluation software on a standalone computer and load the source document data from a computer-readable medium.
Next the user transmits a request to compare the source documents corresponding to the web site to multiple and various usability rules stored by the program (step 34). A screen display 100 of exemplary rules are provided in FIGS. 5 and 6. The rules may be presented according to a number of headings 102, such as page rules, graphics rules, navigation rules and structure rules. Other headings or fewer headings 102 may be provided. The screen display may also present individual rule titles 104 which generally describe the rule to be applied. Each rule may further have a weight 106 associated therewith. The weight 106 is used to generate a usability rating as described further below. The weights may be adjusted by selecting the weight 106 and inputting a new number variable as displayed according to checkbox 106 of FIG. 6. Preferably, weights are to be applied on a scale of one to five. However, other weight systems may be used. A number of checkboxes 105 may be presented by which a user may designate whether particular rules or groups of rules are to be applied. It is further contemplated that a user 2 may define and incorporate their own rules into the evaluation software.
In a preferred embodiment, the rules to be employed as well as the statistical weights to be applied thereto are based on user surveys in which participants are asked to rate various existing web sites according to ease-of-use, navigability and appearance. An analysis of the questionnaires may provide a guideline to identifying and quantifying those attributes which negatively impact a user's perception of the web site. Other criteria for determining problems, statistical weights, and the like may be used.
Returning to process 31, in response to the user's request, the processing instructions of the evaluation software next direct the processor 20 of the server 16 to parse each received source document 118 to generate a corresponding tag list, which includes the document tags stored in each source document 118 (step 36). The tags may be written in a format that corresponds to standard web document languages such as HTML or XML. As used herein, the parsing of a source document may involve transforming a stream of computer-readable data (e.g., representing HTML code) into a useable data structure which is then stored in an accessible memory, such as memory 32 or the RAM component of RAM/ROM 26. Such a data structure preferably represents the content of the page and is optimized for access speed. The evaluation program may then query the data structure to determine what web site attributes are present in the source document 118. Next, the evaluation software generates a model of the entire web site based on the source documents received (step 38). According to one embodiment of the present invention, once the tag list for each page has been assembled, the evaluation software constructs a model of the web site which includes, for example, the interconnections among the various pages of the web site. The site model designates which page corresponds to the home page of the web site and further identifies which is the current web page chosen for the evaluation. In one embodiment, the site model is a graphic depiction representing all possible connections between any two pairs of pages in the web site. The connections are preferably determined from HTML links presented in the source documents as well as anchors, included images, frames, automatic refresh pages, background images, image map links, and the like included in the web site. The site model particularly helps determine problems with respect to site rules and the like as described further below.
The evaluation software may then direct the processor 20 to compare the attributes from the tag list corresponding to a source document 118 to the stored usability rules (step 40). Each usability rule is preferably a set of processing instructions that are applied to each page systematically and which may be applied in any order. Such processing instructions direct the processor 20 to collect needed information from the tag list corresponding to the source document 118, and from the site model. There may be three types of rules: (1) those applied to individual pages and analyzing features of those particular pages (referred to herein as single page rules); (2) those that compare a page with a model page or another page of the web site (referred to herein as pairwise rules); and (3) rules which check features that do not belong to specific web pages, but rather, are properties of the entire site (referred to herein as site rules). The pairwise rules are generally used to check the consistency between a page chosen as a model and each other page that should conform to the former one. Checking is performed by extracting several features of the model page and verifying that the remaining pages have them as well.
Next, the processing instructions may direct the processor 20 to compare tag list data to a specific standard and if the standard is not met, a usability problem is generated and identified to the user (step 42). According to the degree of conformance of such a data with respect to the rule and the weight identified with the rule, the problem is assigned to a severity level. In accordance with this step, a screen display 120 reporting detected web site attribute problems is depicted in FIG. 10. According to one embodiment of the present invention, the evaluation software may present a user 2 with a list of pages 124 from the identified web site 122. A first column 126 presents the number of detected problems associated with each page and the value 128 representing the average number of problems per page. A second column 130 may present a severity rating associated with each page which may be determined based upon the number of problems detected and the weights of the detected problems. The severity may also be represented in a graphical format. The second column may also include the average severity of the web site 132, which may in turn, be used to generate a usability rating for the web site, in any useful manner. A third column 134 may further be presented which lists the number of problems fixed and the number of problems originally present.
Next, the user 2 may indicate whether problems identified by the evaluation software are to be corrected (step 44). If so, the process 31 continues to step 46 below. Otherwise the process continues step 54, also described below. In order to fix identified problems, the user 2 may select a page listed in column 124 of screen display 120. The evaluation software may then present a dialog box, such as dialog box 136 of FIG. 11, which lists the individual problems associated with the selected page. The dialog box 136 may further contain a window 138 for presenting a description 142, an explanation 139 and one or more suggested corrections 144 for the one or more listed problems.
The user 2 may then decide if one or more of the suggested corrections should be implemented (step 46). If the user 2 decides to use the suggested corrections, the user 2 may activate a fix problem command 145, after which the evaluation software inserts the suggested corrections into the applicable source document 118 (step 50). If, however, the user 2 wishes to correct the attributes manually, the user 2 may instead select the edit page command 146 and insert desired replacement code in the source document 118 (step 48). Steps 44-50 may be repeated iteratively, as needed, for each problem identified with the subject web site. Next, the user 2 may decide if the web site is to be re-evaluated (step 52). if so, the process 31 returns to step 40 above. Otherwise, the process 31 continues to step 54 wherein the evaluation software generates a usability rating of the web site. The usability rating is determined in any useful manner by considering the number of problems associated with the web site, the weights associated with those problems, and a percentage of the web site for which there are no demonstrable problems. Other factors may be used to generate the usability rating. The usability rating may take the form of a percentage, but may alternately be described in terms of a number scale, a letter grade or any other useful indicium of web site ease-of-use, navigability, appearance, and/or user-friendliness. In one embodiment, the usability rating may be generated by the number of pages having problems with a high weight or severity in comparison to the number of pages of the web site. In another embodiment, usability ratings may be assigned in the following manner. A rating of 100% would be assigned to a web site which does not demonstrate catastrophic problems (i.e. those problems having a highest weight), does not demonstrate major problems (those having an intermediate weight) and has less than a predetermined number of minor problems per page(those with the lowest weights). A web site would receive a usability rating of 80% when there are no catastrophic problems, less than a predetermined number (e.g. 3) of major problems per page, and less than a predetermined number (e.g. 10) of minor problems per page. A usability rating of 60% would be applied a web site which contains no catastrophic problems, less than a predetermined number (e.g. 10) of major problems per page, and less than a predetermined number (e.g. 10) of minor problems per page. A 40% rating would be applied to web sites which do not demonstrate catastrophic problems, but demonstrate higher than predetermined numbers of major and minor problems. A 0% rating would be applied to a web site demonstrating catastrophic problems. Other rating systems may be used.
Once generated, the user 2 may decide whether to place the rating on the web site (step 56). This may be accomplished by inserting appropriate web page formatting commands into a source document of the web site. An example of a displayed usability rating 160 is presented in FIG. 13. If the user 2 decides to insert the rating, the process 31 continues to step 58 where the appropriate instructions are inserted into the source document 118, after which the process 31 ends. If, on the other hand, the user 2 decides not to insert the rating, the process 31 ends without performing step 58 The following are rule titles corresponding to rules which may be employed by a method or apparatus of the present invention. Additional rules may be used and fewer rules may be employed. Preferred rules, however, include: (1) Body Color Portability, (2) Portable Font Color, (3) Known Body Color, (4) Known Font Color, (5) No Blinking, (6) Headings Exist, (7) Body Consistency, (8) Body Color Consistency, (9) Consistent Link Colors, (10) Noframes Exists, (11) Noframes With Content, ( 12) Noframes Links, (13) IFRAME Exists, ( 14) Frame Borders, (15) Frame Margin, (16) Frame Targetting, (17) Noresize Frame, (18) Frames Are Bad, (19) Page Size Small, (20) IMG with ALT, (21) Img Alt Short, (22) ALT With Content, (23) IMG With Size, (24) Image Stretching, (25) Forced Download Images, (26) Background Consistency, (27) Description Analysis, (28) Keywords Analysis, (29) Title Analysis, (30) Explicit Mail To, (31 ) VLINK_differs_from_LINK, (32) Non-Standard LINK, (33) Empty Anchor Links, (34) Consistent Navigationals, (35) Reachable Home Page, (36) Logical Path Rule, (37) Isolated Pages, (38) Self Referential Page, (39) Broken Links, (40) Click Counting, (41) Http Equiv Refresh Is Bad, (42) Http Equiv Refresh Forward, (43) Http Equiv Refresh Backward, (44) Invisible Image Maps, (45) Image Map Links, (46) Show Links, (47) Show Paths, (48) Show Site Navigationals, (49) Max Click Number, (50) List Of Images, (51) Recycle Graphics, (52) Email Consistency, (53) Pre-Filtered Page, (54) Show Parser Errors, (55) Marquee Is Bad, (56) Spacer Is Bad. Each of attribute problem, a description of the problem, and suggested fixes for the problem are presented in detail below.
The rule relating to Body Color Portability checks that colors specified in a BODY element of the source document for foreground or background are recognizable by various versions of standard web browsers. Such colors may be those belonging to a predefined list of color names or those specified as RGB values belonging to a specific subset. Using non-standard colors may lead to different renderings between different browsers on displays not supporting 24-bit colors (i.e. millions of colors), thus reducing the graphical quality of the site. To correct a detected problem with this attribute, the evaluation software first extracts the colors specified in the BODY HTML elements of the source document 118 and compares them with the predefined list of acceptable colors or to standard RGB formats. The software then signals a problem for each page that uses colors not satisfying these constraints. Possible fixes for this problem include presenting an interactive tool to choose an acceptable color to substitute for the original, problematic color listed.
The rule relating to Portable Font Color checks that colors specified in <FONT COLOR=...> tags of the source document 1 18 are portable across standard web browsers.
Portable colors are those belonging to a predefined list of color names or those specified as RGB values belonging to a standard format. Using non-standard colors may lead to differing presentations of a web site among different types of web browsers, especially those which do not support 24-bit colors. This, in turn, may reduce the graphical quality of the web site. To identify corrections to this problem, the rule extracts from the source document those colors specified as attributes of FONT elements, generally having the form <FONT COLOR=...>. Next, the evaluation software compares these colors with a predefined list of standard colors and/or the standard RGB formats. The software signals a problem for each page that contains FONT elements with colors not satisfying those constraints. The software may then suggest a fix for this problem by presenting an interactive tool to choose an acceptable color to substitute for the original color.
The rule relating to known body color checks that colors specified in the BODY element of a source document are specified either as RGB strings or as symbolic names for VGA standard colors only. The list of VGA colors is constituted by 16 color names that all browsers understand. Using symbolic names for colors, except for recognized VGA colors, may lead to different renderings between different browsers, and some may not even be recognized, thus reducing the graphical quality of the site. In order to correct this problem, the evaluation software extracts the colors specified as attributes of the BODY HTML element in the page and compares them with the predefined list and the VGA standard. The software then identifies a problem for each page that uses colors not satisfying these constraints. The software may then suggests a fix for the problem by presenting an interactive tool to choose an acceptable to substitute for the original color. The rule relating to Known Font Color checks that colors specified in a <FONT
COLOR=...> tag are specified either as RGB strings or symbolic names for VGA colors only. As stated previously, the list of VGA colors is constituted by 16 color names that all browsers understand. Using symbolic names for colors, except for the VGA ones, may lead to different renderings between different browsers. Some names may not even be recognized, thus reducing the graphical quality of the site. In order to fix this problem, the evaluation software extracts from the source document 118 those colors specified as attributes of FONT elements in the page which have the form <FONT COLOR=...>. Next, the software compares these colors with the predefined list and the RGB format. The software then identifies a problem for each page that contains FONT elements with colors not satisfying those constraints. The software may suggest a fix to the problem by presenting an interactive tool from which the user may choose an acceptable color to substitute for the original one.
The rule relating to No Blinking checks that the page does not contain the BLINK element of an HTML document. The BLINK element, though being part of the HTML 4.0 standard, is considered an undesirable element because surveys have indicated that blinking animated features on a computer screen have a strong negative impact on a user's peripheral visual perception. In other words, a user's attention is constantly distracted away from the interesting parts of the page. It has been further shown that distracted users will move quickly to other sites, thus decreasing visitor retention times.
In order to address this, the evaluation software extracts from the page all the BLINK elements and generates a problem for all of them. In addition, the software suggests to fix the problem by substituting the BLINK element with other means to emphasize text, including, for example, inserting a new color, increasing font size, using boldface (e.g. by employing a <STRONG> tag), or by using italics (e.g. by employing an <EM> tag).
The rule relating to Headings Exist checks that if the page contains a reasonable amount of text (pages consisting mainly of graphics may be skipped), then the text should be split and heading elements should be inserted. Headings <H1>, <H2>, <H3> are used in html pages to structure text into blocks. The advantages of using headings are: (1) the text may be easier to scan and read (2) the text may be more effectively rendered by speaking machines, and (3) the heading text may used when a search engine indexes the page.
In order to fix problems of this nature, the evaluation software first counts the words that constitute the content of a page. If the resulting number is above a predefined threshold, then it extracts all the heading elements. If none are found then a problem is generated. The software then suggests to fix the error by splitting the page content into sections and using heading elements to specify sections titles.
The rule relating to Body Consistency is a pairwise rule that checks that the subject page contains a particular body element if such element is found in the model page. If this is not true, then the two pages have a different structure that breaks consistency. Changing structure between pages may confuse readers to the point that they may not know if they are still visiting the same site, or if they got the page they requested. In order to address this problem, the evaluation software extracts BODY elements from the two pages and compares results. If the compared elements are different then a problem is generated. The software then suggests that the problem be fixed by making the subject web page use exactly the same structure as that used by the model page.
The rule relating to Body Color Consistency is a pairwise rule that checks that a model page, such as a home page of the web site, and the current page specify the same color for the following components: background, foreground text, visited links, unvisited links and active links. Changing colors between pages may confuse readers to the point that they may not know if they are still visiting the same site, if they got the page they wanted, or if the meaning of colored objects has changed. In order to address this problem, the evaluation software extracts from the model and the current page the color specified for the considered components. If they differ (or only one of the two pages specify it) then a problem is generated. The evaluation software then suggests that the user replace, add or remove the color used in the current page, accordingly. This may be accomplished through a suitable dialog box that presents the information extracted from the model and offers the means to copy the colors to the current page. The rule relating to Consistent Link Colors is a pairwise rule that checks if individual instances of links have colors that are inconsistent between the model and the current pages. Inconsistent use of link colors is a source of confusion for users because they may not easily realize that words printed in a new color link to the same type as those that they have previously seen. In order to address this problem, the rule extracts FONT elements from both pages and the colors specified in those FONT elements. If the two pages differ for the same link type, then the evaluation software signals a problem. The problem may be fixed through a dialog box that shows which elements in the model page use which color and that highlights those elements in the current page using a color not appearing in the model. The rule relating to No Frames Exists checks that if the page contains a FRAMESET element (that is, an element specifying that the page should have a frame-based structure), then the page should also contain a NOFRAMES element. This is because a frameset without <NOFRAMES> ... < NOFRAMES> will not be displayed by certain browsers which can not render frames. Such browsers include early versions of well-known browsers as well as certain modern specialized browsers, like speaking machines, browsers for PDAs, and text-only browsers. Thus, by failing to specify a NOFRAMES element, many users will not be able to access such a page and might decide to give up exploring the site. In order to address this problem, the evaluation software determines whether the source e document 118 includes a FRAMSET and, if so, whether a NOFRAMES element exists. If it does not exist, then a problem is generated. The software then suggests to fix the problem by adding a NOFRAMES element. A suggestion presented for inserting such a NOFRAMES element may be similar to the following:
Consider adding the following HTML code after </FRAMESET>: <NOFRAMES>
<BODY bgcolor=RGBCOLOR text=RGBCOLOR background=IMAGE>.... </BODY>
< NOFRAMES>
The rule relating to No Frames With Content checks that if the page contains a NOFRAMES element, then it should contain meaningful text that is to be substituted for the frame, rather than sentences like 'you need a frame enabled browser to view this page'. The NOFRAMES element is intended to be used to provide users of browsers that are not frame- enabled the means to access the site, although with a slightly degraded graphic and interaction quality. A NOFRAMES element containing text stating that the page cannot be rendered by the browser being used does not help the user to access the page content that is framed.
In order to address this problem, the evaluation software collects all the text included in the NOFRAMES element, extracts the words that appear more frequently and removes from the resulting list of words other words that are not significant (like 'browser', 'frame', 'download', etc.). If less than a predefined proportion (e.g. 50%) of the words remain, then a problem is generated since too few significant words have been found. Next, the rule extracts the most frequent words contained in the pages that constitute the FRAMESET components of the current page and shows them to the user. The user can then read those words and perhaps use them to fill the content of the NOFRAMES element.
The rule entitled No Frames Links checks that if a page that contains a FRAMESET, and further contains a NOFRAMES element, then the NOFRAMES element should include many of the links contained in the framed pages. This is because the NOFRAMES element is intended to provide users of browsers that are not frame-enabled the means to access the site anyway, although perhaps with a slightly degraded graphic and interaction quality. A NOFRAMES element that does not contain links, will not permit navigation from the page, thus unintentionally preventing a user from exiting the page without deactivating their browser.
In order to address this problem, the evaluation software extracts two lists of links: those connecting the NOFRAMES element with other pages and those connecting the framed pages with other pages. It then compares the lists and if less than, for example, 70% of the links found in framed pages occur in the NOFRAMES element, then a problem is generated. Since it is necessary to determine where in the web page to place the missing links, the user must generally provide a manual fix to this problem.
The rule entitled IFRAME Exists checks if an IFRAME element is present in the page. Though included in the HTML 4.0 standard, the IFRAME element (referring to "inline frame") is not supported by NETSCAPE NAVIGATOR versions 2, 3, or 4.1. Nor is it supported by INTERNET EXPLORER verson 2. Thus, users who surf the net with such browsers may not be able to correctly see the page. In order to correct this problem, the evaluation software extracts any IFRAME elements from the page. If some are found, then a problem is generated. Since there is no easy way to simulate the effect of IFRAME using other standard HTML elements, the software suggests that the user fix the problem by either reorganizing the page without using IFRAME or providing an alternative route for users not using INTERNET EXPLORER 3+.
The rule entitled Frame Borders checks if the page specifies frame borders in a way that is incompatible between the two major browsers in the market: NETSCAPE NAVIGATOR and INTERNET EXPLORER. This is because these browsers use different HTML coding for determining whether frame borders should be drawn, and the width of the frame. A FRAMESET or FRAME element that uses only one type of coding will be properly viewed only on one type of browser. Thus, users surfing with other than that browser will not receive a properly rendered page. For FRAMESET, NETSCAPE NAVIGATOR 3+ and INTERNET EXPLORER 4+ use the format: FRAMEBORDER=no|yes
INTERNET EXPLORER 3+ uses the format: FRAMEBORDER=0| 1
FRAMESPACING=number
For FRAME, NETSCAPE NAVIGATOR 3+ and INTERNET EXPLORER 4+ use the format:
FRAMEBORDER=no|yes INTERNET EXPLORER 3+ uses the format: FRAMEBORDER=0|1 In order to correct this problem, the evaluation software checks what combination of HTML code is used to specify frame borders. In particular it checks if apparently redundant attributes are used that can be understood by both major browsers. The code that guarantees that both browser will work is displayed below:
Within FRAMESET or FRAME:
FRAMEBORDER=no co-occurs with FRAMEBORDER=0 FRAMEBORDER=yes co-occurs with FRAMEBORDER=l
Within FRAMESET the following attributes and values should be used:
FRAMEBORDER=no and FRAMEBORDER=0 and BORDER=0 and
FRAMESPACING=0
The evaluation software may offer a dialog box showing what kind of attributes are used in the page and highlighting what other attributes should be added to guarantee full compatibility with current major browsers. The dialog box also provides the user with the ability to automatically add the missing attributes.
The rule entitled Frame Margin determines whether the page specifies frame margins in a way that is compatible with major browsers in the market, for example, MARGINHEIGHT and MARGINWIDTH can be used to specify the size of top/bottom and left/right margins (e.g., the distance between text and border of window). Unfortunately, certain browsers do not allow a margin=0. If any of the two attributes are set to 0, then a software error occurs with these browsers In order to address this problem, the evaluation software extracts from the page the attributes concerned with frame margins. It checks if MARGINWIDTH and/or MARGINHEIGHT are used with a value equal to 0. The software then shows the FRAME elements with such margins and suggests that the user remove the margin specification and instead use an appropriate background color for frames which will, in turn, camouflage the border when shown by non-supported browsers.
The rule entitled Frame Targeting concerns a problem where pages linked from within a frame may be displayed in different windows. The TARGET tag in HTML is one attribute controlling such an effect. A common design mistake is to forget to specify the target for pages that should not be framed. A user following such links will be trapped inside a frame until a new browser window is explicitly opened. Another mistake is to use TARGET=_blank, which will tell the browser to open a new window each time the user clicks on the link, quickly clogging the user's desktop.
In order to address these problems, the evaluation software collects all the links leaving the page, if the page is rendered within a frameset. It then removes from such a list those links pointing towards other framed pages belonging to the same frameset. It further removes those links that are not important. (Important links are those pointing to pages internally and to pages external to the site that are often used within the site. In practice, important links are determined by scanning all the pages of the site, collecting the frequencies with which their destination URLs are used, and selecting those most frequently used. Links connecting two frames are generally not considered.) Next, the rule extracts the TARGET attribute for such links. If the
TARGET is '_self or is missing, then a problem is generated (because the link destination will be rendered by browsers within the current frame). If it is '_blank' then another kind of problem is generated. That is, a new browser window will be opened for rendering the link destination. In order to fix this problem, the evaluation software opens a dialog box which shows the list of links having wrong targets suggesting a correct target.
The rule entitled Noresize Frame confirms that the FRAME elements of the subject source document 118 do not contain the attribute NORESIZE. This is because users may want to move frames borders to better render web site content within their browser. Often this is needed because the configuration of their browser (e.g., font size, image downloading, etc.) may lead to an inconvenient layout of the frame. By specifying NORESIZE, the width or height of a frame is frozen, thus reducing user flexibility.
In order to address this problem, the evaluation software collects the FRAME elements contained in the page and checks if the NORESIZE attribute is set. If so, it identifies the problem to the user. A dialog box is then opened which shows the non-resizable frames and offers to automatically remove that attribute.
The rule entitled Frames Are Bad determines whether frames are present in a page and if so simply gives a warning message to the user. Frames suffer from a number of usability drawbacks, such as: (1) framed pages cannot be bookmarked ; (2) users often get confused as to which frame is active; (3) frames cannot be rendered by specialised browsers, like text-based or speaking machines; and (4) framed page are often not indexed by search engines, as they have no recognizable content. The rule entitled Page Size Small determines whether the page can be quickly downloaded by visitors. Many visitors use slow internet connections and will be inconvenienced if the download time for a page is too long.
In order to address this problem, the evaluation software determines the size of all the files involved in the page (e.g., background and other images, framed pages, etc.), computes a sum and then computes the estimated download time for website users that may use different modem speeds (e.g. 14.4, 28.8, 33.6, 56.0 kbps). Special attention is paid if the page being analyzed is a home page of the web site, since this will be the first page a visitor is presented with. If a download time for the page exceeds a predetermined or user-selected threshhold, the evaluation software recommends a reduction in the size of the page. The software may further highlight the size of particular page components, such as images, in order to assist the user in limiting the page size.
The rule entitled IMG With ALT determines whether images embedded in a page contain an alternative textual description. Images are not always downloaded by users and rendered by browsers. Image downloading may be disabled in many web browsers in order to increase an access to a web site. Thus, providing an alternative textual description of the image with the ALT attribute helps these users to determine what kind of image they missed and might be interested in. Furthermore, ALT descriptions are usually downloaded and rendered immediately by browsers, whereas the image itself might require several seconds to download and render. Finally, non graphically capable browsers (speaking machines or text-based browsers), are still able to provide some information to users. In order to address this problem, the evaluation software extracts all the IMG elements from a page and then checks if they specify an ALT attribute. If not, a problem is generated and the software provides a recommendation that an ALT string with suitable text be inserted. The rule entitled IMG Alt Short determines whether images embedded in page have an alternative textual description that is not too long. Long strings will typically be truncated by browsers or won't be rendered at all. Therefore, long ALT strings contribute negatively to the download time of a web page. The absolute limit of an ALT string is 1024 characters, but it is reasonable for a web designer to adopt a much smaller limit.
In order to address the problem, the evaluation software presents a dialog in which the ALT string of each collected image is shown, together with the image itself, and the user can edit or reword the string so that it becomes shorter. Then the new string can be saved attached to its image in the current page or in all the occurrences of that image in other pages of the site.
The rule entitled ALT With Content determines whether the alternative string associated with an image does not consist of words that simply suggest the user to use a graphic-enabled browser. The ALT attribute should be used to convey some information about the content of the associated graphics. In this manner, non-graphical browsers (e.g., text-based, speaking machines, browsers with image download disabled and browsers for blind people) can still provide some meaningful information about the graphic image that can not be presented. Furthermore, search engines can better index the page. In order to address this problem, these strings are extracted from the page and words belonging to a specific subset of generic terms (e.g., 'browser', 'graphic', 'enable') are removed. The number of remaining words are compared to the original size of the string, and if the ratio is below a predefined threshold, then a problem is signaled. The software then presents a dialog box in which the ALT string of each collected image is shown, together with the image itself. The user can then edit or re-word the text so that it becomes shorter. Then the new string can be saved attached to its image in the current page, as well as in all the occurrences of that image in other pages of the sites. The rule entitled IMG With Size determines whether images that are embedded in the current page declare their sizes in HTML. It is important to specify the height and width of an image in order to speed-up the rendering process of the browser. In fact, if such information is missing, the browser needs to download the entire image before drawing it. This, in turn, wastes time by increasing the time a user needs to download the page. In order to address this problem, the evaluation software collects all the embedded images (e.g., IMG elements) and confirms whether they each specify height and width attributes. If at least one of those elements do not specify the height/width attributes, the software identifies the element as an attribute problem. The software then suggests fixing this problem by showing the user a dialog box in which each image is shown with its physical size, if it can be obtained from the file containing the image itself. (This is currently possible for GIF and JPEG files). The user can then input new values for the image geometry or adopt the ones that the software proposes.
The rule entitled Image Stretching checks that if an embedded image is represented as a GIF file, and the IMG element specifies a height/width attribute of the image, then the value of these attributes should be equal to the physical size of the image that is extracted from the file. If the sizes are not equal, then the image is stretched for presentation on the web site. Stretching images, particularly those stored in GIF format, generally does not work and leads to pictures that are badly rendered. In order to address this problem, the evaluation software collects the geometry specified in the HTML tag corresponding to the image and the geometry extracted from the file containing the image. If they differ, then a problem is generated. The same process is repeated for all the images embedded in the current page. The software then suggests that this problem be fixed by displaying a dialog box in which each image is shown along with its actual. The user can fix the images as the software proposes, or instead may abort the fixing process.
The rule entitled Forced Download Images determines whether the page contains images embedded in links that need to be downloaded in order to follow the link, even if the user disabled automatic downloading of images. Images whose downloading cannot be disabled by end users are those within <A> ... </A>. Enabling or disabling downloading of images can make the difference between a usable site and a site that cannot be experienced without very long waiting times. Embedding <IMG...> elements within the label of an anchor <A href=...> labelled with <IMG src=...> </A> forces the downloading of the image even when the user doesn't want to download it. Furthermore, embedding images in anchors is acceptable only if you can safely assume that downloading them will not require long waiting times for the user. In order to address this problem, the evaluation software collects the IMG elements from the page that are contained within <A> elements. A problem with these elements is identified to the user who may manually fix the link data.
The rule entitled Background Consistency is a pairwise rule that determines whether a model page and the current page use the same background image. Changing backgrounds between pages may confuse visitors, who may think that they have inadvertently left the site.
In order to address this problem, the evaluation software first determines whether the two pages specify the same background image. If one does and the other does not, then a problem is identified. A problem is also identified if they both specify different background images, which may be determined from file names and sizes associated with the background. The software may then present a dialog box that presents the background images that it has found, and gives the user an option to change the background image of the current page. The rule entitled Description Analysis determines whether the current page contains a description that is suitable to be shown by search engines when they present he page in their hit list. In order to be properly handled by a search engine, the description string should not exceed
1024 characters and should not contain HTML tags.
For this attribute, the evaluation software determines whether the page contains a <META name=description content- '..." > element. If it does not, then a problem is identified and the software suggest that a meta tag be formed and placed in the source document.
Similar to the description analysis, a rule entitled Keywords Analysis detemines whether the current page contains a set of keywords that an be used by search engines when they index the page. The list of keywords should be shorter than 1024 characters and should not contain HTML tags. The keyword list is important for search engines when this page is indexed because it affects whether the page is retrieved and where the page is positioned in a ranked list of search results computed by the search engines.
In order to address this, the evaluation software determines whether the page contains a
<META name=keywords content- '..." > element. If not, a problem is identified. The software then suggests that the user provides a list of keywords. Additionally, the software may generate a suitable list by automatically extracting and ranking text from the page.
The rule entitled Title Analysis confirms that the page has a title and that it is not longer than the maximum length allowed for a title. The page title is used in bookmark lists and by search engines to display results lists. Titles that are too long are truncated. Thus, it is important that titles are brief, informative and contextual, so that the page can be bookmarked and clearly identified.
In order to address this, the evaluation software extracts the TITLE element, if any. If no such attribute is found, then a problem is generated. If it is found, then its content is analyzed to determine if the title length is longer than 64 characters. If so, a second problem is identified.
The software then presents a dialog box through which a user may define a new title, or change the content of the current title. The user inputs the desired string and an appropriate TITLE element is inserted in the page. The rule entitled Explicit Mail To relates to MAILTO: links which enable a user to send an email message by selecting a link. Accordingly, the link should contain an actual email address.
In order to address this problem, the evaluation software extracts all the labels of all the links having MAILTO: as protocol. It then checks if the email address specified in the URL of the link is present in the text of the <A> element containing it. If not, a problem is identified. To fix this kind of problems the user can open a dialog box in which the email address and the label of each MAILTO: link may be entered. By pressing a button the label can be changed to the address and then saved.
The rule entitled VLINK_Differs_From_LINK confirms that a color used for visited links is different than the one used for non-visited links. Many users rely on the fact that links that have been already followed change their color. In this way they can tell if part of a site has already been visited. In order to address this problem, the evaluation software reads the values of the attributes LINK and VLINK from the BODY element of the page, and compares them. If they are the same color (even if their name is different, like for 'BLACK' and '#000000', which are two ways to specify the same color) a problem is generated. To fix this problem a dialog can be opened which presents the two attributes and their values, as well as a color list from which the user can choose the desired colors. The conventional colors are highlighted, so that the user can select also those that are typical for websites.
The rule entitled Non-Standard LINK determines whether the page uses a conventional color for displaying visited and non visited links, respectively blue and purple. Many users rely on the fact that followed links are purple and new links are blue.
In order to address this problem, the evaluation software extracts the BODY element from the page and confirms whether it contains the attributes LINK and VLINK. If it does and the two colors are different than the conventional ones, then a problem is identified. A dialog box may then be opened which enables the user to set the default colors for these attributes. The rule entitled Empty Anchor Links is used to determine whether the page contains link specifications that do not have a clickable label. These links can occur where a web designer misplaces a tag for the A element. These <A href=.„> links cannot be followed by users because they have no label. They are generally harmless, but they add up to the page size, thus increasing page download time. In order to address this problem, the evaluation software extracts from the page all the A elements and checks what they have as labels. Those having an image or those having a nonempty string are skipped. The remaining elements are then identified as problematic. A dialog can be used to fix this problem. The dialog box displays the list of links. The user can then type the label for each of those links and save them within the source document 118.
The rule entitled Consistent Navigationals is a pairwise rule which determines whether the navigationals that are used in the model page are also used in the current page. Navigationals are defined as those links pointing to pages within the site and pages external to the site that are often used. In practice, important links are determined by scanning all the pages of the site, collecting the frequencies with which their destination URLs are used, and selecting those most frequently used. Links connecting two frames are not considered. When links that are often used in the site are missing from a page, it may mean that an inconsistent navigational structure is present in the page. If this is the case, then users will have a hard time exploring the site because relevant paths are not available and paths that users have already seen in previous pages no longer exist.
In order to address this problem, the evaluation software examines all the links leaving the model page and those leaving the current page. It then considers only those links that are navigationals. If the remaining sets differ substantially, a problem is identified. To fix the problem the user needs to decide which of the navigational links that are missing in the current page have to be added.
The rule entitled Reachable Home Page determines whether the page contains a link to the site home page. If it does not, it may pose a major navigational obstacle for users who, for example, bookmark and visit an internal web page, but wish to visit the home page for the site therefrom.
In order to address this problem, the evaluation software collects from the site model all the pages that are connected to the current one. If the home page does not appear among such a list, then a problem is identified. To fix this kind of problem, the software prompts the user to determine where and how to include a link to the home page. The rule provides a template for a textual link to the home page, but the actual fixing itself has to be performed by the user because of the determinations that have to be made. The rule entitled Logical Path Rule determines which is the shortest path from the home page to the current page that a visitor can follow by clicking on links. The software then checks whether from the current page there a link to each of the intermediate pages in such a path. Users may forget the route they followed to reach this page. Providing navigational aids in each page to help users to reach intermediate and initial pages of a site makes it much more navigable and usable. In addition, such navigational aids are clues for users to better understand the structure of a site. Relying only on the "BACK" button of the browser to support effective navigation of the site is not enough since some users may be visiting the page from a search engine or a bookmark, and the "BACK" button would not work.
In order to address this problem, the evaluation software scans all the links present in the page (e.g., those specified as textual ~ A ~ or embedded in images ~ AREA --) and checks whether they are included in the shortest path connecting the home page and the current page. If some are missing then a problem is identified. The problem can then be manually fixed by selecting the navigation structure that the software presents to the user and including it in the page. In this case, automatic fixing of the source document is not possible since adding links may require substantial reorganization of the page.
The rule entitled Isolated Pages is used to determine whether the current page can be reached from the home page. If isolated pages cannot be reached by users that start from the home page, then such users will not be able to view the page, even if it contains important material. The only way they would be able to view this page is via a search engine or through some bookmark. Isolated pages may occur due to spelling errors in links of other pages.
In order to address this problem, the evaluation software searches the site model for paths connecting the home page to the current one. If none is found (within preset time limits, which may be selected by users) then the page is declared to be unreachable and a problem is identified. To fix this problem the user needs to reorganize part of the site, which has to be done manually.
The rule entitled Self Referential Page is used to determine whether the page contains links that point to itself. Such links are generally useless.
In order to address this problem, the evaluation software collects all the links in the page that point to the very same page. It does not consider A elements whose destinations are named locations within the page (i.e. elements having this form <A HREF=thispage# namedlocation> ... </A> are skipped). The software then recommends that these links be deleted.
The rule entitled Broken Links is used to determine whether links contained in the page point to resources that are valid. The presence of invalid links on the page may confuse and irritate users. Such links may be the result of a typing error in the listed URL.
In order to address this problem, the evaluation software collects all the links pointing to resources external to the page (including URLs to other pages and images embedded in the page). If the links point to resources that are locally stored, then these resources are checked. If they are not accessible to the software, then they are assumed to be inaccessible, and a problem is identified. If links point to resources that are external to the site (i.e. they reside on a web server that is different from the one serving the current page), then these links are probed. If this fails because of network problems then nothing is done. If probing instead fails because the resources cannot be accessed, then a further problem is identified. Fixing this problem requires checking and modifying the URL of the broken links and possibly removing the link from the page if the resource does not exist anymore. A dialog box can be presented so that a user may perform both these activities.
The rule entitled Click Counting is used to determine whether the number of clicks that a user needs to perform to reach the current page starting from the home page is greater than a given threshold (which may be predefined by the user). An inordinate number of clicks may deter users from reaching and reading the page. Furthermore, it may confuse novice users because they would not be able to find what they need, leading them to conclude that the site does not contain such information. In order to address this problem, the evaluation software searches the path in the site model and counts the number of necessary clicks. If it exceed the threshold a problem is identified. To fix such a problem, site reorganization by the user is required.
The rule entitled Http Equiv Refresh Is Bad is used to determine whether the page contains HTML code that has the effect of automatically loading another page after some time has elapsed from the time when the current page was first selected by the browser. Such automatic loading of a new page has a number of drawbacks: (1) it may disorient users while they are reading the content of this page; (2) it may not allow users to examine and follow links present in this page; and (3) it reduces the effectiveness of the browser's BACK button (which is the second most used features by web users, after link following). In order to address this problem, the evaluation software determines whether the page has an element similar to the following one:
<META HTTP-EQUIV="refresh" CONTENT="5; url=new-page.html:"> If so and if the time specified in it is below a threshold (defined in the preferences), a problem is identified. Fixing this problem may require a reorganization of the page. However, the software may offer a dialog box that shows which is the destination page, and offers a button that can be used to automatically remove the META element. The rule entitled Http Equiv Refresh Forward is used to determine whether the page contains HTML code that has the effect of automatically loading another page. If so, then the page should also contain an explicit link to the same destination page. If the link does not exist, users wishing to access the page are forced to wait for the redirection to occur.
In order to address this problem, the evaluation software checks if page has an element similar to the following one:
<META HTTP-EQUIV="refresh" CONTENT="5; url=new-page.html:"> If so and if the page does not contain another link to 'new-page.html' then a problem is identified. Fixing this problem may require adding a new link to the destination page and in general this cannot be done automatically. The rule however offers a dialog showing the relevant information and enables the user to automatically remove the META element.
The rule entitled Http Equiv Refresh Backward determines whether the current page is the destination of some automatic redirection taking place in some other page. If so, then the current page should contain the link pointing back to such a page. This is because users with a slow connection will not be given the time to read that page and perhaps follow its links. The new page (i.e. the current one) will be automatically downloaded instead, and no links to the previous page are provided to the user.
In order to address this problem, the software first determines whether the current page can be reached by another page through a link specified as: <META HTTP-EQUIV="refresh" CONTENT="5; url=current_page.html:"> If so and if the current page does not contain another link to that page then a problem is identified. Fixing this problem may require the addition a new link to the destination page and in general this cannot be done automatically. The rule however offers a dialog showing the relevant information and enabling the user to automatically remove the META element from the other page.
The rule entitled Unvisible Image Maps is used to determine whether in the HTML code there are structures that are useless because no user would ever see them displayed by any sort of browser. Sometimes web designers use cut-and-paste methods to create new pages from old ones. It sometimes happens that the new page contains some 'left over' elements, like image maps (i.e. sets of links) without having an associated image. Such MAP elements do not appear in any image and are useless. Unused HTML code contributes to the page size and therefore its download time.
In order to address this problem, the evaluation software is programmed to scan the page and searches for MAP elements that are not associated to any images in the page. If such elements are found, a problem is identified to the user 2. The software or the user 2 may fix the problem by deleting the unneeded element.
The rule entitled Imagemap Links is used to determine whether links present in image maps are duplicated as textual links. Links specified by the image map should also appear as normal textual links, for three reasons: (1) many users browse the web with "image loading" disabled; such users would not be able to follow your links; (2) many browsers are not capable of rendering all the graphics (e.g. speaking machines, microbrowsers on hand-held devices) and therefore those links would not be accessible; and (3) imagemap links are not followed by all search engines, and therefore pages they point to would not be indexed.
In order to address this problem, the evaluation software is programmed to scan the page and extract all the visible AREA elements and their link lists. It then checks that these links appear also within A elements elsewhere in the page. It collects those that do not appear elsewhere and identifies them as a problem. Fixing this problem requires the addition of one or more A elements with these, links. Since user input is involved in the correction, the software presents all the relevant information to the user who then may modify the page using other tools. The rule entitled Show Links is a rule that displays some information about the page to the user. It does not identify usability problems but instead presents some statistics about the page. In particular it shows which are the links (of any sort) leaving the page. It displays their label (if textual) and their destination URL, and highlights local links.
The rule entitled Show Paths is a rule that displays some information about the page to the user. It does not identify usability problems, but instead presents some statistics about the page. In particular it shows a number (by default 50) of the shortest paths connecting the home page to the current page.
The rule entitled Show Site Navigationals is a site rule that displays some information about the site to the user. It does not identify usability problems but instead provides some statistics about the site. In particular it shows the list of local links that are important navigationals for the site. Links connecting two frames are not considered.
The rule entitled Max Clicks Number is a site rule that checks the average number of clicks on links that a user has to perform in order to reach each page. If the average number is above a predefined threshold (set in the preferences), then a problem is signaled. If the site contains many pages that, in order to be reached from the home page, require too many links to be followed, this may deter users from reaching and reading those pages. Furthermore, it may confuse novice users because they would not be able to find what they need, leading them to conclude that the site does not contain such information. In order to address these problems, the evaluation software computes the length of the shortest path from the home to every other site page. It then computes the average. It finally compares such a value with a predefined threshold.
Such a problem is not automatically fixable by the software, as it requires major reorganization of the site. However, the software may highlight those pages that are farthest from the home page.
The rule entitled List Of Images is a site rule that displays a list of the images used by pages of the site. It displays each image's URL and the pages that use that image. No usability problem is associated with this rule. However, it is helpful to use to track images used on the web site. The rule entitled Recycle Graphics is a site rule which confirms that image files on the site can be cached by a browser. Generally, different pages pointing to the same image should refer to the same file. Images that are included in web pages, before being rendered, are cached by the user browser in order to avoid downloading them when the same image has to be displayed the next time. However, if the image is referred to by different names (i.e. different URLs), each reference is cached separately and hence has to be downloaded separately, thus increasing the download time the user has to wait.
In order to address this problem, the evaluation software generates a table which maps the name and size of every image file on a site to a list of URLs referring to those files. The table is then used to check that image files are reused instead of copied. The software displays the images that are stored in duplicative files, including their size and the pages that use them. A dialog box can then be presented so that the user may replace or edit the URLs so that all requisite pages point to the same image file. The rule entitled Email Consistency is a site rule that checks whether each e-mail address included in a link element has the same label throughout the site. Inconsistent labels for e-mail addresses may confuse users of the site, requiring them to spend time and effort to investigate which of the apparently different links should be followed.
In order to address this problem, the evaluation software collects all the labels associated to <A HREF=MAILTO: ...> links in all the pages of the site and then compares their e-mail addresses. If two links refer to the same address but have different labels the problem is identified to the user 2. The software then presents all the addresses that have more than one label and a dialog box can be opened so that the user may choose a single label to be associated to each e-mail address, and further to replace previously used labels in some or all of the pages. The rule entitled PreFilteredPage is an HTML validation rule. It checks whether the values associated to elements attribute is correctly written. In particular it checks if the value requires double quotes. The rule also automatically fixes this problem. Certain browsers may fail to process such pages because their HTML does not satisfy the standard (HTML 3.2 or HTML 4.0). The software of the present invention scans the page and locates attribute values that are violating the above-mentioned requirement. It then may automatically add double quotes to prevent browser from accessing such commands and re-saves the source document.
The rule entitle Show Parser Errors is used to report the HTML errors that were found when parsing the page and those elements within the source document 118 that depend on the use of non-standard HTML. If the HTML contained in the page is not conformed to a current standard then some browsers may render the page in an unpredictable way.
In order to address this problem, the evaluation software compares each HTML structure used in the page to its standard definitions which is mentioned in the DOCTYPE specification in the page. If it is missing then in the preferences the user can select what standard to use. Each error is described in a concise way, as well as its location in the HTML code. To fix these kind of problems extensive re-coding may be required. The software, therefore, enables the user to open an HTML editor positioned in the error location.
The rule entitled Marquee Is Bad confirms that the current page does not contain the MARQUEE element, which creates a scrolling text area. <MARQUEE> .. </MARQUEE> can currently only be used with INTERNET EXPLORER. It is not standard HTML 4.0 nor can it be understood by other browsers. Animated features within a page such as a marquee can also negatively affect the way in which a user looks and reads the page content. Users will be continuously distracted away from the most important page content. In addition, users browsing this page with a browser that cannot understand the MARQUEE element will be presented with a page that does not work as intended.
In order to address this problem, the program scans the page searching for marquee elements. If any are found, then a problem is identified. The software then shows in detail the MARQUEE elements that it has found and recommends that they be disabled or deleted. The rule entitled SpacerlsBad is used to determine whether the current page contains a
SPACER element, which creates a blank space. Currently, <SPACER> .. </spacer> can only be used with NETSCAPE NAVIGATOR. It is not a standard HTML 4.0 command. Thus, a page incorporating this command may not be properly displayed by some browsers. In order to address this problem, the software scans the page searching for SPACER elements. If any are found, then a problem is identified. The software shows in detail the SPACER elements that it has found and proposes their replacements with alternative (but more standard) ways to achieve the same effect (eg. using '&nbsp;' or transparent gifs). A dialog box can be presented to a user so that they may replace the SPACER elements with selected alternatives.
The process and apparatus of the invention are further contemplated to include the following features. In a fine-tuning procedure contemplated for the present invention, a user may modify both the number of usability rules to be employed as well as the weight or severity of each rule after an initial analysis, according to process 31, is performed. The user may adjust both the number of rules used and the weight or severity of any number of the rules, and then reanalyze the web site, all in an iterative manner. This may be performed so as to tailor the analysis to web sites with particularized functionality.
The method and apparatus of the present invention is further contemplated to include a recorder feature and a downloading feature. The recorder feature can be used to receive dynamic web pages (e.g. those created by a web server on the fly) that are generated in response to user inputs. The recorder may copy the generated document and store it as a static file for analysis. The download feature can be used to download one or more source documents based on a submitted URL. The download feature may be customized to include user-definable features such as how links in the source documents are to be followed.
An authoring plug-in feature is also contemplated to be included in the present invention. This plug-in may be used to interface with web authoring software applications to receive source document information. This feature then enables a user to fix and modify proprietary web page templates used by such wed authoring software.
A server log analysis feature is further contemplated to be used with the present invention. The server log analysis plug-in may use server logs to decipher user activity on a web site and identify problematic attributes therefrom. The server log plug-in may be adapted to accommodate any format used by a web server to monitor such user activity. The server plug-in may contain a separate set of rules from those described previously which employ server log information to identify attribute problems.
Finally, a customized rules plug-in feature allows a user to generate and use customized rules which the user may develop to identify problems with web site attributes. The method and apparatus of the present invention may be adapted to present the user with a specialized user interface for entering new rule criteria. The computer language of the new rules does not have to match the language in which the predefined rules are stored. This feature may further allow a user to remotely store new usability rules and to instruct a server performing an analysis where to locate the stored rules on a network or the Internet.
Although the invention has been described in detail in the foregoing embodiments, it is to be understood that the descriptions have been provided for purposes of illustration only and that other variations both in form and detail can be made thereupon by those skilled in the art without departing from the spirit and scope of the invention, which is defined solely by the appended claims.

Claims

What is claimed is:
1. A method for evaluating a web site, comprising: receiving a source document corresponding to a web page having a plurality of attributes, the web page corresponding to a web site; comparing at least one of the attributes to a usability rule; and generating a usability rating of the web site Dάsed on said comparing.
2. The method of claim 1, wherein comparing at least one of the plurality of attributes further comprises: parsing at least one tag from the source document; and comparing the at least one tag to the usability rule.
3. The method of claim 2, further comprising: generating at least one replacement tag after the comparison, wherein the at least one replacement tag will increase the usability rating of the web site.
4. The method of claim 3, further comprising: replacing the at least one tag with the at least one replacement tag.
5. The method of claim 4, wherein replacing the at least one tag further comprises: receiving a user request to replace the at least one tag with the at least one replacement tag.
6. The method of claim 2, wherein the plurality of tags comprises at least one HTML tag.
7. The method of claim 2, wherein the plurality of tags comprises at least one XML tag.
8. The method of claim 2, wherein comparing the at least one attribute comprises: comparing the at least one attribute to a plurality of usability rules.
9. The method of claim 8, wherein at least one of the usability rules has a weight associated therewith.
10. The method of claim 9, wherein the usability rating is based on the weight of the at least one usability rule.
11. The method of claim 1 , further comprising inserting the usability rating into the source document.
12. The method of claim 1, wherein said comparing comprises: comparing the plurality of attributes to a plurality of usability rules.
13. The method of claim 12, further comprising: identifying a problem associated with at least one of the attributes.
14. The method of claim 12, further comprising: after said generating, receiving a second source document having a corrected attribute; and comparing the plurality of attributes, including the corrected attribute, to the plurality of usability rules; and generating a second usability rating based on the second source document.
15. The method of claim 14, wherein the corrected attribute corresponds to the problem.
16. The method of claim 1, wherein said receiving comprises: receiving a uniform resource locator corresponding to the web site; and downloading the source document from the web site.
17. The method of claim 1, wherein said comparing comprises: generating a site model corresponding to the web site.
18. The method of claim 1 wherein the usability rule comprises at least one of a single rule, a pairwise rule and a site rule.
19. The method of claim 1, wherein the at least one attribute comprises a body color portability of one of (i) a foreground and (ii) a background color listed in a body tag of the source document.
20. The method of claim 19, wherein the usability rule comprises a requirement that each color listed in the body tag consists of at least one of a predetermined list of standard colors.
21. The method of claim 1 , wherein the at least one attribute comprises a font color portability of each font color listed in a font color tag of the source document.
22. The method of claim 21, wherein the usability rule comprises a requirement that each font color consists of at least one of a predetermined list of standard colors.
23. The method of claim 1 , wherein the at least one attribute comprises a known body color in a body tag of the source document.
24. The method of claim 23, wherein the usability rule comprises a requirement that each body color listed in the body consists of at least one of (i) an RGB string and (ii) a VGA color.
25. The method of claim 1 , wherein the at least one attribute comprises a known font color in a font color tag of the source document.
26. The method of claim 25, wherein the usability rule comprises a requirement that each font color in the font color tag consists of at least one of: (i) an RGB string and (ii) a VGA color.
27. The method of claim 1, wherein the at least one attribute comprises a number of blink elements in the source document.
28. The method of claim 27, wherein the usability rule comprises a requirement that the number of blink elements is zero.
29. The method of claim 1 , wherein the at least one attribute comprises a number of headings in the source document.
30. The method of claim 29, wherein the usability rule comprises a requirement that the number of headings corresponds to an amount of text in the source document.
31. The method of claim 1 , wherein the at least one attribute comprises a body consistency of the source document.
32. The method of claim 31, wherein the usability rule comprises a requirement that the source document include a body element if a second source document corresponding to the web site includes a second body element.
33. The method of claim 1, wherein the at least one attribute comprises a body color consistency of the source document.
34. The method of claim 33, wherein the usability rule comprises a requirement that a set of body colors listed in the source document for each of a background color, a foreground text, a selected link, an unselected link and an active link matches a second set of body colors corresponds to a second set of body colors listed in a second source document corresponding to the web site.
35. The method of claim 1, wherein the at least one attribute comprises a link color of the source document.
36. The method of claim 35, wherein the usability rule comprises a requirement that the link color of the source document corresponds to a second link color of a second source document corresponding to the web site.
37. The method of claim 1, wherein the at least one attribute comprises a frameset element of the source document.
38. The method of claim 37, wherein the usability rule comprises a requirement that the source document includes a noframes element if the source document includes a frameset element .
39. The method of claim 1, wherein the at least one attribute comprises a noframes element of the source document.
40. The method of claim 39, wherein the usability rule comprises a requirement that the source document includes a textual description corresponding to the noframes element if the source document includes the noframes element.
41. The method of claim 1, wherein the at least one attribute comprises noframes links of the source document.
42. The method of claim 41, wherein the usability rule comprises a requirement that the source document includes a set of links in a noframes element which corresponds to a second set of links in a frameset element of the source document.
43. The method of claim 1, wherein the at least one attribute comprises a number of iframe elements of the source document.
44. The method of claim 43, wherein the usability rule comprises a requirement that the number of iframe elements is zero.
45. The method of claim 1 , wherein the at least one attribute comprises a frame border of the source document.
46. The method of claim 45,'wherein the usability rule comprises a requirement that each frame border corresponds to a predetermined format.
47. The method of claim 1, wherein the at least one attribute comprises a frame margin of the source document.
48. The method of claim 47, wherein tϊie usability rule comprises a requirement that each frame margin corresponds to a predetermined format.
49. The method of claim 1 , wherein the at least one attribute comprises a target tag of a link within a frame of the source document.
50. The method of claim 49, wherein the usability rule comprises a requirement that the target tag includes a designation that a web page corresponding to the link is rendered outside the frame and within a current browser window.
51. The method of claim 1 , wherein the at least one attribute comprises a number of noresize elements within each frame of the source document.
52. The method of claim 51 , wherein the usability rule comprises a requirement that the number of noresize elements is zero.
53. The method of claim 1, wherein the at least one attribute comprises a number of frames of the source document.
54. The method of claim 53, wherein the usability rule comprises a requirement that the number of frames is zero.
55 The method of claim 1, wherein the at least one attribute comprises a page size of the source document.
56. The method of claim 56, wherein the usability rule comprises a requirement that the page size is less than a predetermined size.
57. The method of claim 1 , wherein the at least one attribute comprises an image description corresponding to an image presented by the source document.
58. The method of claim 57, wherein the usability rule comprises a requirement that the source document includes an alternate text description for each image presented by the source document.
59. The method of claim 1 , wherein the at least one attribute comprises a size of an alternate text description of an image presented by the source document.
60. The method of claim 59, wherein the usability rule comprises a requirement that the size corresponds to a predetermined value.
61. The method of claim 1 , wherein the at least one attribute comprises a content of an alternate text description corresponding to an image presented by the source document.
62. The method of claim 61, wherein the usability rule comprises a requirement that the content includes a description other than a suggestion to implement image browsing.
63. The method of claim 1, wherein the at least one attribute comprises an embedded size description corresponding to an image presented by the source document.
64. The method of claim 63, wherein the usability rule comprises a requirement that the embedded size description includes a height and a width of the image.
65. The method of claim 1, wherein the at least one attribute comprises a GIF image size description of the source document.
66. The method of claim 65, wherein the usability rule comprises a requirement that the GIF image size description corresponds to a file size of a GIF image presented by the source document.
67. The method of claim 1, wherein the at least one attribute comprises a number of links of the source document with embedded image information of the source document.
68. The method of claim 67, wherein the usability rule comprises a requirement that the number of links with embedded image information is zero.
69. The method of claim 1, wherein the at least one attribute comprises a background image of the source document.
70. The method of claim 69, wherein the usability rule comprises a requirement that the background image correspond to a second background image of a second source document.
71. The method of claim 1, wherein the at least one attribute comprises a source document description of the source document.
72. The method of claim 71, wherein the usability rule comprises a requirement that the source document description corresponds to a predetermined size.
73. The method of claim 1, wherein the at least one attribute comprises a keyword description of the source document.
74. The method of claim 73, wherein the usability rule comprises a requirement that a size of the keyword description corresponds to a predetermined size and that the keyword description does not include an HTML tag.
75. The method of claim 1, wherein the at least one attribute comprises a title of the source document.
76. The method of claim 75, wherein the usability rule comprises a requirement that the title corresponds to a predetermined size.
77. The method of claim 1, wherein the at least one attribute comprises a mailto link of the source document.
78. The method of claim 77, wherein the usability rule comprises a requirement that the mailto link includes an e-mail address.
79. The method of claim 1, wherein the at least one attribute comprises a visited link color designation of the source document.
80. The method of claim 79, wherein the usability rule comprises a requirement that the visited link color designation differs from an unvisited link color designation.
81. The method of claim 1 , wherein the at least one attribute comprises a first color designation for visited links and a second color designation for non-visited links of the source document.
82. The method of claim 81 , wherein the usability rule comprises a requirement that the first and second color designations correspond to predetermined colors.
83. The method of claim 1, wherein the at least one attribute comprises a label of a link presented by the source document.
84. The method of claim 83, wherein the usability rule comprises a requirement that the label is enabled for activation by a mouse.
85. The method of claim 1, wherein the at least one attribute comprises a navigational consistency of the source document.
86. The method of claim 85, wherein the usability rule comprises a requirement that a link presented by a plurality of separate source documents which correspond to the web site is presented by the source document.
87. The method of claim 1 , wherein the at least one attribute comprises a home page navigational link for the source document.
88. The method of claim 87, wherein the usability rule comprises a requirement that the source document present a link to a home page corresponding to the web site.
89. The method of claim 1, wherein the at least one attribute comprises a logical path corresponding to the source document.
90. The method of claim 89, wherein the usability rule comprises a requirement that at least one link corresponding to at least one intermediate web page is presented by a source document.
91. The method of claim 1, wherein the at least one attribute comprises an isolation condition of the web page corresponding to the source document.
92. The method of claim 91, wherein the usability rule comprises a requirement that a home page of the web site contain a link to the web page.
93. The method of claim 1, wherein the at least one attribute comprises a number of self-referential links presented by the source document.
94. The method of claim 93, wherein the usability rule comprises a requirement that the number of self-referential links is zero.
95. The method of claim 1, wherein the at least one attribute comprises a number of broken links presented by the source document.
96. The method of claim 95, wherein the usability rule comprises a requirement that the number of broken links is zero.
97. The method of claifr. i , wherein the at least one attribute comprises an accessibility of the web page corresponding to the source document from a home page of the web site.
98. The method of claim 69, wherein the usability rule comprises a requirement that a number of clicks to access the web page from the home does not exceed a predetermined number.
99. The method of claim 1 , wherein the at least one attribute comprises a refresh command of the source document.
100. The method of claim 99, wherein the usability rule comprises a requirement that the refresh command does not initiate a loading of a second web page.
101. The method of claim 1 , wherein the at least one attribute comprises a refresh forward command of the source document.
102. The method of claim 101, wherein the usability rule comprises a requirement that a link to a second web page is presented which second web page is designated to be loaded according to the refresh forward command.
103. The method of claim 1, wherein the at least one attribute comprises a refresh backward command of the source document.
104. The method of claim 101, wherein the usability rule comprises a requirement that a link to a second web page is presented which second web page is designated to be loaded according to the refresh backward command.
105. The method of claim 1, wherein the at least one attribute comprises a number of image map elements of the source document.
106. The method of claim 105, wherein the usability rule comprises a requirement that the number of image map elements is zero.
107. The method of claim 1, wherein the at least one attribute comprises an image map link present in the source document.
108. The method of claim 107, wherein the usability rule comprises a requirement that the image map link corresponds to a textual link presented by the source document.
109. The method of claim 1, wherein the at least one attribute comprises an average number of clicks needed to reach the web page from a home page of a web site.
1 10. The method of claim 109, wherein the usability rule comprises a requirement that the number of clicks is no greater than a predetermined value.
1 1 1. The method of claim 1 , wherein the at least one attribute comprises an image file corresponding to an image presented by the source document.
112. The method of claim 11 1, wherein the usability rule comprises a requirement that the image file corresponds to a second instance of the image file in a second source document of the web site which presents the image.
113. The method of claim 1, wherein the at least one attribute comprises an element value of the source document.
114. The method of claim 1 13, wherein the usability rule comprises a requirement that the element value corresponds to a predetermined format.
1 15. The method of claim 1 , wherein the at least one attribute comprises a number of non-standard language elements of the source document.
1 16. The method of claim 1 15, wherein the usability rule comprises a requirement that the number of non-standard language elements is zero.
1 17. The method of claim 1 , wherein the at least one attribute comprises a number of marquee elements for presentation by the source document.
118. The method of claim 117, wherein the usability rule comprises a requirement that the number of marquee elements is zero.
1 19. The method of claim 1, wherein the at least one attribute comprises a number of spacer elements of the source document.
120. The method of claim 119, wherein the usability rule comprises a requirement that the number of spacer elements is zero.
121. A computer-readable medium encoded with processing instructions for implementing a method for evaluating a web site, the method comprising: receiving a source document corresponding to a web page having a plurality of attributes, the web page corresponding to a web site; comparing at least one of the attributes to a usability rule; and generating a usability rating of the web site based on said comparing.
122. An apparatus for evaluating a web site, comprising: means for receiving a source document corresponding to a web page having a plurality of attributes, the web page corresponding to a web site; means for comparing at least one of the attributes to a usability rule; and means for generating a usability rating of the web site based on said comparing.
123. An apparatus for evaluating a web site, comprising: a processor; and a memory in operative connection with the processor, the memory for storing processing instructions for directing the processor to: receive a source document corresponding to a web page having a plurality of attributes, the web page corresponding to a web site; compare at least one of the attributes to a usability rule; and generate a usability rating of the web site based on said comparing.
124. A method for evaluating a web site, comprising: identifying a source document corresponding to a web site having a plurality of attributes; requesting a comparison of the source document to a usability rule; and receiving a usability rating of the web site in response to the request.
125. A method for increasing a usability of a web site, comprising: receiving a source document corresponding to a web site having a plurality of attributes; receiving a request for a comparison of the source document to a plurality of usability rules; and sending an identification of a problem with at least one of the attributes in response to the request.
126. The method of claim 125, wherein the comparison further comprises: parsing at least one tag from the source document; and comparing the at least one tag to the usability rule.
127. The method of claim 126, further comprising: generating at least one replacement tag after the comparison, wherein the at least one replacement tag will increase the usability rating of the web site.
128. The method of claim 127, further comprising: replacing the at least one tag with the at least one replacement tag.
129. The method of claim 128, wherein replacing the at least one tag further comprises: receiving a user request to replace the at least one tag with the at least one replacement tag.
130. The method of claim 126, wherein the plurality of tags comprises at least one HTML tag.
131. The method of claim 126, wherein the plurality of tags comprises at least one XML tag.
132. The method of claim 126, wherein comparing the at least one attribute comprises: comparing the at least one attribute to a plurality of usability rules.
133. The method of claim 132, wherein at least one of the usability rules has a weight associated therewith.
134. The method of claim 133, wherein the usability rating is based on the weight of the at least one usability rule.
135. The method of claim 125, further comprising: generating a usability rating of the web site based on the problem; and inserting the usability rating into the source document.
136. The method of claim 125, wherein said comparison comprises: comparing the plurality of attributes to a plurality of usability rules.
137. The method of claim 136, further comprising: identifying a problem associated with at least one of the attributes.
138. The method of claim 137, further comprising: after said generating, receiving a second source document having a corrected attribute; and comparing the plurality of attributes, including the corrected attribute, to the plurality of usability rules; and generating a second usability rating based on the second source document.
139. The method of claim 138, wherein the corrected attribute corresponds to the problem.
140. The method of claim 125, wherein said receiving comprises: receiving a uniform resource locator corresponding to the web site; and downloading the source document from the web site.
141. The method of claim 125, wherein said comparison comprises: generating a site model corresponding to the web site.
142. The method of claim 125 wherein the usability rule comprises at least one of a single rule, a pairwise rule and a site rule.
143. The method of claim 125, wherein the at least one attribute comprises a body color portability of one of (i) a foreground and (ii) a background color listed in a body tag of the source document.
144. The method of claim 143, wherein the usability rule comprises a requirement that each color listed in the body tag consists of at least one of a predetermined list of standard colors.
145. The method of claim 125, wherein the at least one attribute comprises a font color portability of each font color listed in a font color tag of the source document.
146. The method of claim 145, wherein the usability rule comprises a requirement that each font color consists of at least one of a predetermined list of standard colors.
147. The method of claim 125, wherein the at least one attribute comprises a known body color in a body tag of the source document.
148. The method of claim 147, wherein the usability rule comprises a requirement that each body color listed in the body consists of at least one of (i) an RGB string and (ii) a VGA color.
149. The method of claim 125, wherein the at least one attribute comprises a known font color in a font color tag of the source document.
150. The method of claim 149, wherein the usability rule comprises a requirement that each font color in the font color tag consists of at least one of: (i) an RGB string and (ii) a VGA color.
151. The method of claim 125, wherein the at least one attribute comprises a number of blink elements in the source document.
152. The method of claim 151, wherein the usability rule comprises a requirement that the number of blink elements is zero.
153. The method of claim 125, wherein the at least one attribute comprises a number of headings in the source document.
154. The method of claim 153, wherein the usability rule comprises a requirement that the number of headings corresponds to an amount of text in the source document.
155. The method of claim 125, wherein the at least one attribute comprises a body consistency of the source document.
156. The method of claim 155, wherein the usability rule comprises a requirement that the source document include a body element if a second source document corresponding to the web site includes a second body element.
157. The method of claim 125, wherein the at least one attribute comprises a body color consistency of the source document.
158. The method of claim 157, wherein the usability rule comprises a requirement that a set of body colors listed in the source document for each of a background color, a foreground text, a selected link, an unselected link and an active link matches a second set of body colors corresponds to a second set of body colors listed in a second source document corresponding to the web site.
159. The method of claim 125, wherein the at least one attribute comprises a link color of the source document.
160. The method of claim 159, wherein the usability rule comprises a requirement that the link color of the source document corresponds to a second link color of a second source document corresponding to the web site.
161. The method of claim 125, wherein the at least one attribute comprises a frameset element of the source document.
162. The method of claim 161, wherein the usability rule comprises a requirement that the source document includes a noframes element if the source document includes a frameset element
163. The method of claim 125, wherein the at least one attribute comprises a noframes element of the source document.
164. The method of claim 163, wherein the usability rule comprises a requirement that the source document includes a textual description corresponding to the noframes element if the source document includes the noframes element.
165. The method of claim 125, wherein the at least one attribute comprises noframes links of the source document.
166. The method of claim 165, wherein the usability rule comprises a requirement that the source document includes a set of links in a noframes element which corresponds to a second set of links in a frameset element of the source document.
167. The method of claim 125, wherein the at least one attribute comprises a number of iframe elements of the source document.
168. The method of claim 167, wherein the usability rule comprises a requirement that the number of iframe elements is zero.
169. The method of claim 125, wherein the at least one attribute comprises a frame border of the source document.
170. The method of claim 169, wherein the usability rule comprises a requirement that each frame border corresponds to a predetermined format.
171. The method of claim 125, wherein the at least one attribute comprises a frame margin of the source document.
172. The method of claim 171, wherein the usability rule comprises a requirement that each frame margin corresponds to a predetermined format.
173. The method of claim 125, wherein the at least one attribute comprises a target tag of a link within a frame of the source document.
174. The method of claim 173, wherein the usability rule comprises a requirement that the target tag includes a designation that a web page corresponding to the link is rendered outside the frame and within a current browser window.
175. The method of claim 125, wherein the at least one attribute comprises a number of noresize elements within each frame of the source document.
176. The method of claim 175, wherein the usability rule comprises a requirement that the number of noresize elements is zero.
177. The method of claim 125, wherein the at least one attribute comprises a number of frames of the source document.
178. The method of claim 177, wherein the usability rule comprises a requirement that the number of frames is zero.
179. The method of claim 125, wherein the at least one attribute comprises a page size of the source document.
180. The method of claim 179, wherein the usability rule comprises a requirement that the page size is less than a predetermined size.
181. The method of claim 125, wherein the at least one attribute comprises an image description corresponding to an image presented by the source document.
182. The method of claim 181, wherein the usability rule comprises a requirement that the source document includes an alternate text description for each image presented by the source document.
183. The method of claim 125, wherein the at least one attribute comprises a size of an alternate text description of an image presented by the source document.
184. The method of claim 183, wherein the usability rule comprises a requirement that the size corresponds to a predetermined value.
185. The method of claim 125, wherein the at least one attribute comprises a content of an alternate text description corresponding to an image presented by the source document.
186. The method of claim 185, wherein the usability rule comprises a requirement that the content includes a description other than a suggestion to implement image browsing.
187. The method of claim 186, wherein the at least one attribute comprises an embedded size description corresponding to an image presented by the source document.
188. The method of claim 187, wherein the usability rule comprises a requirement that the embedded size description includes a height and a width of the image.
189. The method of claim 125, wherein the at least one attribute comprises a GIF image size description of the source document.
190. The method of claim 189, wherein the usability rule comprises a requirement that the GIF image size description corresponds to a file size of a GIF image presented by the source document.
191. The method of claim 125, wherein the at least one attribute comprises a number of links of the source document with embedded image information of the source document.
192. The method of claim 191, wherein the usability rule comprises a requirement that the number of links with embedded image information is zero.
193. The method of claim 125, wherein the at least one attribute comprises a background image of the source document.
194. The method of claim 193, wherein the usability rule comprises a requirement that the background image correspond to a second background image of a second source document.
195. The method of claim 125, wherein the at least one attribute comprises a source document description of the source document.
196. The method of claim 195, wherein the usability rule comprises a requirement that the source document description corresponds to a predetermined size.
197. The method of claim 125, wherein the at least one attribute comprises a keyword description of the source document.
198. The method of claim 197, wherein the usability rule comprises a requirement that a size of the keyword description corresponds to a predetermined size and that the keyword description does not include an HTML tag.
199. The method of claim 125, wherein the at least one attribute comprises a title of the source document.
200. The method of claim 199, wherein the usability rule comprises a requirement that the title corresponds to a predetermined size.
201. The method of claim 125, wherein the at least one attribute comprises a mailto link of the source document.
202. The method of claim 201 , wherein the usability rule comprises a requirement that the mailto link includes an e-mail address.
203. The method of claim 125, wherein the at least one attribute comprises a visited link color designation of the source document.
204. The method of claim 203, wherein the usability rule comprises a requirement that the visited link color designation differs from an unvisited link color designation.
205. The method of claim 125, wherein the at least one attribute comprises a first color designation for visited links and a second color designation for non-visited links of the source document.
206. The method of claim 205, wherein the usability rule comprises a requirement that the first and second color designations correspond to predetermined colors.
207. The method of claim 125, wherein the at least one attribute comprises a label of a link presented by the source document.
208. The method of claim 207, wherein the usability rule comprises a requirement that the label is enabled for activation by a mouse.
209. The method of claim 125, wherein the at least one attribute comprises a navigational consistency of the source document.
210. The method of claim 209, wherein the usability rule comprises a requirement that a link presented by a plurality of separate source documents which correspond to the web site is presented by the source document.
211. The method of claim 125, wherein the at least one attribute comprises a home page navigational link for the source document.
212. The method of claim 211, wherein the usability rule comprises a requirement that the source document present a link to a home page corresponding to the web site.
213. The method of claim 125, wherein the at least one attribute comprises a logical path corresponding to the source document.
214. The method of claim 213, wherein the usability rule comprises a requirement that at least one link corresponding to at least one intermediate web page is presented by a source document.
215. The method of claim 125, wherein the at least one attribute comprises an isolation condition of the web page corresponding to the source document.
216. The method of claim 215, wherein the usability rule comprises a requirement that a home page of the web site contain a link to the web page.
217. The method of claim 125, wherein the at least one attribute comprises a number of self-referential links presented by the source document.
218. The method of claim 217, wherein the usability rule comprises a requirement that the number of self-referential links is zero.
219. The method of claim 125, wherein the at least one attribute comprises a number of broken links presented by the source document.
220. The method of claim 219, wherein the usability rule comprises a requirement that the number of broken links is zero.
221. The method of claim 125, wherein the at least one attribute comprises an accessibility of the web page corresponding to the source document from a home page of the web site.
222. The method of claim 221, wherein the usability rule comprises a requirement that a number of clicks to access the web page from the home does not exceed a predetermined number.
223. The method of claim 125, wherein the at least one attribute comprises a refresh command of the source document.
224. The method of claim 223, wherein the usability rule comprises a requirement that the refresh command does not initiate a loading of a second web page.
225. The method of claim 125, wherein the at least one attribute comprises a refresh forward command of the source document.
226. The method of claim 225, wherein the usability rule comprises a requirement that a link to a second web page is presented which second web page is designated to be loaded according to the refresh forward command.
227. The method of claim 125, wherein the at least one attribute comprises a refresh backward command of the source document.
228. The method of claim 227, wherein the usability rule comprises a requirement that a link to a second web page is presented which second web page is designated to be loaded according to the refresh backward command.
229. The method of claim 125, wherein the at least one attribute comprises a number of image map elements of the source document.
230. The method of claim 229, wherein the usability rule comprises a requirement that the number of image map elements is zero.
231. The method of claim 125, wherein the at least one attribute comprises an image map link present in the source document.
232. The method of claim 231 , wherein the usability rule comprises a requirement that the image map link corresponds to a textual link presented by the source document.
233. The method of claim 125, wherein the at least one attribute comprises an average number of clicks needed to reach the web page from a home page of a web site.
234. The method of claim 233, wherein the usability rule comprises a requirement that the number of clicks is no greater than a predetermined value.
235. The method of claim 125, wherein the at least one attribute comprises an image file corresponding to an image presented by the source document.
236. The method of claim 235, wherein the usability rule comprises a requirement that the image file corresponds to a second instance of the image file in a second source document of the web site which presents the image.
237. The method of claim 125, wherein the at least one attribute comprises an element value of the source document.
238. The method of claim 237, wherein the usability rule comprises a requirement that the element value corresponds to a predetermined format.
239. The method of claim 125, wherein the at least one attribute comprises a number of non-standard language elements of the source document.
240. The method of claim 239, wherein the usability rule comprises a requirement that the number of non-standard language elements is zero.
241. The method of claim 125, wherein the at least one attribute comprises a number of marquee elements for presentation by the source document.
242. The method of claim 241 , wherein the usability rule comprises a requirement that the number of marquee elements is zero.
243. The method of claim 125, wherein the at least one attribute comprises a number of spacer elements of the source document.
244. The method of claim 243, wherein the usability rule comprises a requirement that the number of spacer elements is zero.
245. An apparatus for increasing a usability of a web site, comprising: means for receiving a source document corresponding to a web site having a plurality of attributes; means for receiving a request for a comparison of the source document to a plurality of usability rules; and means for sending an identification of a problem with at least one of the attributes in response to the request.
246. An apparatus for increasing a usability of a web site, comprising: a processor; and a memory in operative connection with the processor, the memory for storing processing instructions for directing the processor to: receive a source document corresponding to a web site having a plurality of attributes; receive a request for a comparison of the source document to a plurality of usability rules; and send an identification of a problem with at least one of the attributes in response to the request.
247. A computer-readable medium encoded with processing instructions for implementing a method for increasing a usability of a web site, the method comprising: receiving a source document corresponding to a web site having a plurality of attributes; receiving a request for a comparison of the source document to a plurality of usability rules; and sending an identification of a problem with at least one of the attributes in response to the request.
248. A method for increasing a usability of a web site, comprising: identifying a web site having a plurality of attributes; sending a request for a comparison of the web site to a plurality of usability rules; and receiving m identification of a problem with at least one of the attributes in response to the request.
249. The method of claim 248, wherein said identifying comprises: transmitting a uniform resource locator corresponding to the web site.
250. The method of claim 248, wherein said identifying comprises: loading at least one source document corresponding to the web site.
251. The method of claim 248, further comprising: receiving a usability rating of the web site based on said comparing.
252. The method of claim 248, further comprising: receiving a usability rating for the web site in response to the request.
253. The method of claim 248, further comprising: receiving a suggested correction for the attribute.
254. The method of claim 253, further comprising: entering the suggested correction into the source document.
255. The method of claim 248, further comprising: correcting the attribute based on said receiving.
PCT/US2001/010862 2000-04-03 2001-04-03 System and method for evaluating an optimizing web site attributes WO2001075678A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001251281A AU2001251281A1 (en) 2000-04-03 2001-04-03 System and method for evaluating an optimizing web site attributes

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US54248200A 2000-04-03 2000-04-03
US09/542,482 2000-04-03

Publications (1)

Publication Number Publication Date
WO2001075678A1 true WO2001075678A1 (en) 2001-10-11

Family

ID=24164001

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/010862 WO2001075678A1 (en) 2000-04-03 2001-04-03 System and method for evaluating an optimizing web site attributes

Country Status (2)

Country Link
AU (1) AU2001251281A1 (en)
WO (1) WO2001075678A1 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2386218A (en) * 2002-03-06 2003-09-10 Fujitsu Ltd Apparatus and method for evaluating web pages
US7020845B1 (en) * 1999-11-15 2006-03-28 Gottfurcht Elliot A Navigating internet content on a television using a simplified interface and a remote control
CN1297936C (en) * 2003-08-15 2007-01-31 国际商业机器公司 Method and system for comparing files of two computers
US7680645B2 (en) 2007-06-15 2010-03-16 Microsoft Corporation Software feature modeling and recognition
US9292615B2 (en) 2011-06-27 2016-03-22 Sitecore A/S Method and a system for analysing impact of changes to content of a website
US9372933B2 (en) 2013-11-06 2016-06-21 International Business Machines Corporation Personalized quantitative measure of accessibility
CN111193795A (en) * 2019-12-30 2020-05-22 腾讯科技(深圳)有限公司 Information pushing method and device, electronic equipment and computer readable storage medium
US10809877B1 (en) 2016-03-18 2020-10-20 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10867120B1 (en) 2016-03-18 2020-12-15 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10896286B2 (en) 2016-03-18 2021-01-19 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
CN112988531A (en) * 2019-12-18 2021-06-18 中国移动通信集团浙江有限公司 Usability evaluation method and device for application software and computing equipment
CN113110979A (en) * 2020-01-13 2021-07-13 中国移动通信集团浙江有限公司 Application program optimization evaluation method and device, computing equipment and storage medium
US11727195B2 (en) 2016-03-18 2023-08-15 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5835905A (en) * 1997-04-09 1998-11-10 Xerox Corporation System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents
US5870559A (en) * 1996-10-15 1999-02-09 Mercury Interactive Software system and associated methods for facilitating the analysis and management of web sites
US5968125A (en) * 1997-01-21 1999-10-19 Net. Roi Process for optimizing the effectiveness of a hypertext element
US6021439A (en) * 1997-11-14 2000-02-01 International Business Machines Corporation Internet quality-of-service method and system
US6052730A (en) * 1997-01-10 2000-04-18 The Board Of Trustees Of The Leland Stanford Junior University Method for monitoring and/or modifying web browsing sessions
US6154736A (en) * 1997-07-30 2000-11-28 Microsoft Corporation Belief networks with decision graphs
US6157618A (en) * 1999-01-26 2000-12-05 Microsoft Corporation Distributed internet user experience monitoring system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5870559A (en) * 1996-10-15 1999-02-09 Mercury Interactive Software system and associated methods for facilitating the analysis and management of web sites
US6052730A (en) * 1997-01-10 2000-04-18 The Board Of Trustees Of The Leland Stanford Junior University Method for monitoring and/or modifying web browsing sessions
US5968125A (en) * 1997-01-21 1999-10-19 Net. Roi Process for optimizing the effectiveness of a hypertext element
US5835905A (en) * 1997-04-09 1998-11-10 Xerox Corporation System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents
US6154736A (en) * 1997-07-30 2000-11-28 Microsoft Corporation Belief networks with decision graphs
US6021439A (en) * 1997-11-14 2000-02-01 International Business Machines Corporation Internet quality-of-service method and system
US6157618A (en) * 1999-01-26 2000-12-05 Microsoft Corporation Distributed internet user experience monitoring system

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7020845B1 (en) * 1999-11-15 2006-03-28 Gottfurcht Elliot A Navigating internet content on a television using a simplified interface and a remote control
GB2386218A (en) * 2002-03-06 2003-09-10 Fujitsu Ltd Apparatus and method for evaluating web pages
US7395498B2 (en) 2002-03-06 2008-07-01 Fujitsu Limited Apparatus and method for evaluating web pages
CN1297936C (en) * 2003-08-15 2007-01-31 国际商业机器公司 Method and system for comparing files of two computers
US7680645B2 (en) 2007-06-15 2010-03-16 Microsoft Corporation Software feature modeling and recognition
US9292615B2 (en) 2011-06-27 2016-03-22 Sitecore A/S Method and a system for analysing impact of changes to content of a website
US9372933B2 (en) 2013-11-06 2016-06-21 International Business Machines Corporation Personalized quantitative measure of accessibility
US11061532B2 (en) 2016-03-18 2021-07-13 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11151304B2 (en) 2016-03-18 2021-10-19 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10845946B1 (en) 2016-03-18 2020-11-24 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10845947B1 (en) 2016-03-18 2020-11-24 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10860173B1 (en) 2016-03-18 2020-12-08 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10867120B1 (en) 2016-03-18 2020-12-15 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10866691B1 (en) 2016-03-18 2020-12-15 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10896286B2 (en) 2016-03-18 2021-01-19 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10928978B2 (en) 2016-03-18 2021-02-23 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10997361B1 (en) 2016-03-18 2021-05-04 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11029815B1 (en) 2016-03-18 2021-06-08 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11836441B2 (en) 2016-03-18 2023-12-05 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11727195B2 (en) 2016-03-18 2023-08-15 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11455458B2 (en) 2016-03-18 2022-09-27 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11157682B2 (en) 2016-03-18 2021-10-26 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US11080469B1 (en) 2016-03-18 2021-08-03 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
US10809877B1 (en) 2016-03-18 2020-10-20 Audioeye, Inc. Modular systems and methods for selectively enabling cloud-based assistive technologies
CN112988531B (en) * 2019-12-18 2022-08-12 中国移动通信集团浙江有限公司 Usability evaluation method and device for application software and computing equipment
CN112988531A (en) * 2019-12-18 2021-06-18 中国移动通信集团浙江有限公司 Usability evaluation method and device for application software and computing equipment
CN111193795A (en) * 2019-12-30 2020-05-22 腾讯科技(深圳)有限公司 Information pushing method and device, electronic equipment and computer readable storage medium
CN111193795B (en) * 2019-12-30 2021-07-02 腾讯科技(深圳)有限公司 Information pushing method and device, electronic equipment and computer readable storage medium
CN113110979A (en) * 2020-01-13 2021-07-13 中国移动通信集团浙江有限公司 Application program optimization evaluation method and device, computing equipment and storage medium
CN113110979B (en) * 2020-01-13 2023-04-07 中国移动通信集团浙江有限公司 Application program optimization evaluation method and device, computing equipment and storage medium

Also Published As

Publication number Publication date
AU2001251281A1 (en) 2001-10-15

Similar Documents

Publication Publication Date Title
US6405192B1 (en) Navigation assistant-method and apparatus for providing user configured complementary information for data browsing in a viewer context
US7167875B2 (en) Web page thumbnails and user configured complementary information provided from a server
US7031968B2 (en) Method and apparatus for providing web site preview information
US6049342A (en) System and method for controlling the downloading and display of an animated sequence from a network
US6055542A (en) System and method for displaying the contents of a web page based on a user&#39;s interests
US6560620B1 (en) Hierarchical document comparison system and method
USRE42413E1 (en) Web search engine with graphic snapshots
US5978828A (en) URL bookmark update notification of page content or location changes
RU2245577C2 (en) Electronic message board and mail server
US7756849B2 (en) Method of searching for text in browser frames
US20040148568A1 (en) Checker and fixer algorithms for accessibility standards
US20010014895A1 (en) Method and apparatus for dynamic software customization
KR20140009483A (en) Message catalogs for remote modules
CN101601033A (en) Generate the Search Results of specialty in response to the medelling inquiry
WO2006053264A1 (en) Active abstracts
JP4753755B2 (en) Data conversion method, apparatus and program
WO2001075678A1 (en) System and method for evaluating an optimizing web site attributes
US7219298B2 (en) Method, system, and program for verifying network addresses included in a file
US20040225672A1 (en) Method for editing a web site
WO2005121982A1 (en) Information providing system, method, program, information communication terminal, and information display switching program
JP5031137B2 (en) Information processing apparatus and control method thereof
US20060184655A1 (en) Traffic analysis
US20110320923A1 (en) Method, Apparatus, Processor-Readable Media and Signals for Acquiring and Presenting Acquired Multimedia Content
GB2326746A (en) Displaying a subset of a list of titles
US7644358B2 (en) Usability of a portal application

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: COMMUNICATION PURSUANT TO RULE 69 EPC (EPO FORM 1205A OF 250303)

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP