WO2000050972A2 - Network meta-information collection tool - Google Patents
Network meta-information collection tool Download PDFInfo
- Publication number
- WO2000050972A2 WO2000050972A2 PCT/US2000/005008 US0005008W WO0050972A2 WO 2000050972 A2 WO2000050972 A2 WO 2000050972A2 US 0005008 W US0005008 W US 0005008W WO 0050972 A2 WO0050972 A2 WO 0050972A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- selection
- received
- web
- page
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/954—Navigation, e.g. using categorised browsing
Definitions
- This invention relates to information collection, more particularly to techniques for collecting information from different networked computers.
- the Internet's World-Wide-Web offers users easy access to a tremendous amount of information on a wide range of subjects.
- Search engines such as YahooJ can provide lengthy lists of sites related to almost any topic of interest.
- formulating meaningful queries can prove difficult to both new and experienced search engine users.
- visiting the tremendous number of web-sites typically retrieved by a search engine can make Internet-based research time-consuming and frustrating.
- Search engines also frequently list Internet sites that may be too advanced for users investigating a topic for the first time. For example, a junior high school student investigating evolution can easily find themselves viewing a web-page that presents an involved Ph.D. thesis instead of a page that presents material in an introductory manner. Beyond finding appropriate information, collecting information from the web requires careful organization of stored web content for later use.
- a method of collecting information received from an Internet or Intranet URL includes receiving a user selection from a web-page corresponding to the URL and analyzing information included in the web-page to determine citation information for the user selection. The method further includes storing and displaying the selection and the determined citation information.
- Uniform Resource Locator Uniform Resource Locator
- a method of collecting information received from a networked computer includes receiving a user selection from displayed information received from a networked computer and analyzing the information received from the network computer to determine at least one attribute of the user selection. The method also includes storing the selection and the determined attribute(s).
- the attribute may be the authorship of the received information.
- the analyzing may include examining tags (e.g., HTML ⁇ META> tags) included in the received information.
- the received information may be a web-page from an Internet or Intranet web-site.
- the user selection may be dragged-and-dropped, cut-and- pasted, or transferred to a storage medium (e.g., a user's hard disk).
- the user selection may include text, graphics, sounds, animation, video, etc.
- the method may further include displaying the stored selection and the determined attributes, for example, by dynamically generating HTML pages that include the stored selection and determined attributes.
- the method may include navigating through a hierachically organized grouping of topics to request the information received from the networked computer.
- a file may include information that describes the hierarchically organized grouping of topics.
- a tool for collecting information received from a networked computer includes means for receiving a user selection from displayed information received from a networked computer, means for analyzing the information received from the network computer to determine at least one attribute of the user selection, and means for storing the selection and the determined attribute(s).
- the tool may also include means for maintaining the tool in the foreground during display of the received information.
- Advantages may include one or more of the following.
- the methods, programs, and tools ease network-based research by speeding information location and collection and easing organization of the collected material. By automatically collecting and storing information needed for citations from tags and other information included in a web-page, users can clip information and provide citations without having to perform their own research to determine authorship, publication date, etc. By providing a persistent window in the foreground to collect user clippings, a user can quickly and easily access clipping functions.
- FIG. 1 is a screenshot of a tool for collecting web-page clippings.
- FIG. 2 is a listing of a file for producing a directory of web-pages hierarchically organized by topic.
- FIGS. 3A and 3B are screenshots illustrating dragging-and-dropping of a file onto the tool.
- FIG. 4 is a screenshot of a user-selected clipping.
- FIG. 5 is a screenshot of notes that a user can enter.
- FIG. 6 is a screenshot of a screen including saved clippings and corresponding citation information.
- FIGS. 7A-7C are screenshots of a different user interface for the tool.
- FIG. 8 is a flowchart of a process for clipping information from different web-pages.
- FIG. 9 is a block diagram of a computer.
- the inventors have designed a tool that works with a user's browser to ease Interneflntranet based research.
- a user can quickly find web-pages that include information pertinent to the user, collect information from the web-pages, and organize the collected material. Additionally, the tool can automatically collect and store information needed for citations to Internet material.
- a browser screen 100 displays a URL's (Universal Resource Locator) 104 graphics and text 102.
- a URL 104 identifies a resource (e.g., a file of HTML (HyperText Markup Language) instructions and data) at a remote network site. HTML instructions describe an arrangement of text, graphics, animation, video, sound, and other data provided by the resource.
- HTML files are delivered to the user's computer.
- a browser e.g., NetscapeJ NavigatorJ, Microsoft 1 Internet ExplorerJ, or MosaicJ constructs a display based on the received information. As shown, the browser is an independent application, however, browser functions can be incorporated into other applications. Additionally, many browsers can process formats other than HTML (e.g., XML).
- FIG. 1 also shows a clipping tool 200 superimposed over the browser screen 100.
- the clipping tool 200 provides different functions for finding research material on the Internet and saving selections from the material (i.e., "clippings") for later use. These functions include a "Select” function that enables a user to quickly find a web-page of interest from a group of pre-selected sites; a "Collect” function that enables a user to save a clipping from a web-page; an "Edit” functions that enables a user to enter notes about the material; and a “List” function that enables a user to manage and view clipped material.
- the tool 200 presents these functions in a window that includes a tabbed GUI (graphical user interface) control that permits quick and intuitive access to the different functions.
- GUI graphical user interface
- the clipping tool 200 remains completely visible regardless of whether the tool 200 or the browser 100 is active (i.e., is currently the target of user input). Thus, a user always has easy access to tool 200 functions without using operating system commands to bring the tool 200 to the foreground.
- the tool 200 and browser 100 can communicate in a variety of ways.
- a MicrosoftJ Internet ExplorerJ browser object can be instantiated by programming languages such as MicrosoftJ Visual BasicJ.
- the tool 200 can control the behavior of the browser 100 using public methods and properties of the instantiated browser object.
- the tool 200 is a Java applet that includes instructions executed by a browser's Java interpreter. A wide variety of other implementations are possible.
- the "Select” function presents a directory 202 of different subject areas.
- the directory 202 organizes these subjects using a hierarchy of directory nodes. At the “bottom" of the hierarchy are nodes that refer to specific URLs.
- the system can provide a preview (e.g., a thumbnail or description)(not shown) of the corresponding URL.
- a user selects e.g., clicks a mouse
- a URL node 204d the tool 200 causes the browser 100 to download information from the URL associated with the node.
- the user has selected the "Metazoa: Animals, Animals, Animals" node 204d.
- the tool 200 can build the directory 202 from a file 208 that identifies the different nodes in the directory. For nodes corresponding to specific web-pages, the file 208 can include both the URL 208 and any other information that might be needed for a citation to the web-page such as authorship 206 information.
- Different files can include different directory information describing different hierarchies of nodes.
- one directory file may include directory information suitable for high school students while another may include information for professional researchers. This enables researchers of different ages and having different research requirements to access suitable material.
- a user can specify which directory 208 the tool 200 displays by dragging-and-dropping a directory file 208 onto the tool 200.
- the tool 200 interprets the file 208 contents to construct a directory.
- the tool 200 can present a GUI control that enables a user to navigate through different system folders and documents in search of a particular directory file.
- FIG. 4 after accessing a URL either by using the directory shown in FIG. 1 or by using more traditional browser techniques (e.g., directly entering a URL or by selecting a link), the tool 200 allows a user to create or add to a list of web-page clippings.
- the tool's 200 "Collect” function presents a control 210 that accepts text, graphics, and other information selected from a web-page.
- a user has selected a paragraph 212 discussing animal morphology.
- a user can "drag” (e.g., moving the mouse while depressing a mouse button) the selection 212 from the web-page and drop the selection (e.g., releasing the mouse button) onto the tool 200.
- the user can copy the selection for later pasting into the tool.
- the tool 200 stores the selected information.
- the tool 200 can also analyze the source of the selection (e.g., the web-page originally received from a networked computer) for attributes of the selected information.
- the tool 200 can also examine text included in the web-page to extract information. For example, the tool 200 can look for text that includes words such as "author” and analyze the text that follows to see if it corresponds to
- the tool 200 can query the browser 100 for information such as the URL currently being displayed.
- the tool 200 can also use system functions to determine other information such as the time a URL was loaded into memory. Additionally, if the user reached the URL via the directory file, the tool 200 can retrieve authorship information stored in the directory file (see FIG. 2).
- the tool 200 automatically initiates its "Edit" function to enable a user to enter notes 214 describing the clipping. These notes 214 enable a user to preserve thoughts about the clipping such as how the clipping might be incorporated into a paper and ideas for subsequent research.
- the tool 200 also generates a default title 212 for the clipping (e.g., the first few words of selected text). As shown, the tool 200 can also display a thumbnail 213 of any selected graphics.
- the tool's 200 "List” function lists different collected clippings 220. Selecting the tool's 200 "View” button 221 causes the tool 200 to dynamically prepare HTML instructions for a display 223 that includes the web-page clippings 212 and the collected citation information 224-238.
- citation information can include when the material was clipped 226, the type of material 228 (e.g., text or graphics), the name of the page 230, and its URL 232.
- the information can further include the site name 234, authors 236, and site URL 238.
- the display 223 further includes any notes the user may have entered for the clipping.
- the URLs 232, 238 can be displayed as
- any number of different user interfaces can provide functions described above.
- the tool 200 "Search" As shown in FIG. 7A, the tool 200 "Search"
- the 10 function uses pull-down menus that enable a user to navigate through a directory 204 to view the content of different network sites. As shown in FIGS. 7B and 7C, the tool's 200 "Collect" function provides a thumbnail 213 image of selected content and an area to enter notes 214.
- a user can navigate to a web-page either using
- the tool can analyze the web-page's contents (e.g., HTML instructions and text) to determine citation information 808. Additionally, the tool can use information stored in the
- the selected clipping and citation information are saved 810. Thereafter, the tool can dynamically produce an HTML instruction file that includes the clippings.
- a computer platform 900 includes a monitor 902, a keyboard 904, a pointing device 906 such as a mouse, and a digital computer
- the digital computer 910 includes memory 912, a processor 914, a mass storage device 916, and other customary components such as a memory bus and peripheral bus (not shown).
- the platform 900 further includes a network connection 908.
- Mass storage device 916 can store instructions for an operating system (e.g., MicrosoftJ WindowsJ), a browser, and the tool 200. The instructions may be transferred to memory 912 and processor 914 in the course of operation. Of course, instructions for the tool 200 may be stored on a variety of mass storage devices such as a floppy disk, CD-ROM, or PROM (not shown).
- the techniques described here are not limited to any particular hardware or software configuration; they may find applicability in any computing or processing environment that may be used for speech recognition.
- the techniques may be implemented in hardware or software, or a combination of the two.
- the techniques are implemented in computer programs executing on programmable computers that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and one or more output devices.
- Program code is applied to data entered using the input device to perform the functions described and to generate output information.
- the output information is applied to one or more output devices.
- Each program is preferably implemented in a high level procedural or object oriented programming language to communicate with a computer system, however, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language.
- Each such computer program is preferable stored on a storage medium or device (e.g., CD-ROM, hard disk or magnetic diskette) that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document.
- a storage medium or device e.g., CD-ROM, hard disk or magnetic diskette
- the system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.
- Other embodiments are within the scope of the following claims. What is claimed is:
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU40045/00A AU4004500A (en) | 1999-02-26 | 2000-02-25 | Network information collection tool |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US25939599A | 1999-02-26 | 1999-02-26 | |
US09/259,395 | 1999-02-26 |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2000050972A2 true WO2000050972A2 (en) | 2000-08-31 |
WO2000050972A3 WO2000050972A3 (en) | 2000-12-28 |
WO2000050972A9 WO2000050972A9 (en) | 2001-04-19 |
Family
ID=22984767
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2000/005008 WO2000050972A2 (en) | 1999-02-26 | 2000-02-25 | Network meta-information collection tool |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU4004500A (en) |
WO (1) | WO2000050972A2 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002021305A2 (en) * | 2000-09-05 | 2002-03-14 | Edgardo Di Nicola Carena | A system and a method to access and organise information available from a network |
US6366923B1 (en) * | 1998-03-23 | 2002-04-02 | Webivore Research, Llc | Gathering selected information from the world wide web |
WO2003079133A3 (en) * | 2002-03-15 | 2004-02-05 | Koninkl Philips Electronics Nv | Previewing documents on a computer system |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996023265A1 (en) * | 1995-01-23 | 1996-08-01 | British Telecommunications Public Limited Company | Methods and/or systems for accessing information |
US5862325A (en) * | 1996-02-29 | 1999-01-19 | Intermind Corporation | Computer-based communication system and method using metadata defining a control structure |
US5931907A (en) * | 1996-01-23 | 1999-08-03 | British Telecommunications Public Limited Company | Software agent for comparing locally accessible keywords with meta-information and having pointers associated with distributed information |
-
2000
- 2000-02-25 WO PCT/US2000/005008 patent/WO2000050972A2/en active Application Filing
- 2000-02-25 AU AU40045/00A patent/AU4004500A/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996023265A1 (en) * | 1995-01-23 | 1996-08-01 | British Telecommunications Public Limited Company | Methods and/or systems for accessing information |
US5931907A (en) * | 1996-01-23 | 1999-08-03 | British Telecommunications Public Limited Company | Software agent for comparing locally accessible keywords with meta-information and having pointers associated with distributed information |
US5862325A (en) * | 1996-02-29 | 1999-01-19 | Intermind Corporation | Computer-based communication system and method using metadata defining a control structure |
Non-Patent Citations (1)
Title |
---|
GRAHAM, I, HTML Sourcebook, February 1997, pages 156-159, XP002931179. * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6366923B1 (en) * | 1998-03-23 | 2002-04-02 | Webivore Research, Llc | Gathering selected information from the world wide web |
WO2002021305A2 (en) * | 2000-09-05 | 2002-03-14 | Edgardo Di Nicola Carena | A system and a method to access and organise information available from a network |
WO2002021305A3 (en) * | 2000-09-05 | 2003-02-27 | Nicola Carena Edgardo Di | A system and a method to access and organise information available from a network |
WO2003079133A3 (en) * | 2002-03-15 | 2004-02-05 | Koninkl Philips Electronics Nv | Previewing documents on a computer system |
Also Published As
Publication number | Publication date |
---|---|
AU4004500A (en) | 2000-09-14 |
WO2000050972A3 (en) | 2000-12-28 |
WO2000050972A9 (en) | 2001-04-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6310630B1 (en) | Data processing system and method for internet browser history generation | |
US6832350B1 (en) | Organizing and categorizing hypertext document bookmarks by mutual affinity based on predetermined affinity criteria | |
US6597377B1 (en) | Web links objects | |
US6366923B1 (en) | Gathering selected information from the world wide web | |
US20180293307A1 (en) | User driven computerized selection, categorization, and layout of live content components | |
US7188319B2 (en) | Displaying graphical information and user selected properties on a computer interface | |
US6023701A (en) | Skeleton page retrieval mode for web navigation | |
US7631255B2 (en) | Interface and method for exploring a collection of data | |
US6233591B1 (en) | Dropping hyperlink onto document object | |
US6667751B1 (en) | Linear web browser history viewer | |
US5978828A (en) | URL bookmark update notification of page content or location changes | |
US6100890A (en) | Automatic bookmarks | |
CA2333372C (en) | Multi-target links for navigating between hypertext documents and the like | |
US7962843B2 (en) | Browser session overview | |
US7315848B2 (en) | Web snippets capture, storage and retrieval system and method | |
US7406664B1 (en) | System for integrating HTML Web site views into application file dialogs | |
US20060155728A1 (en) | Browser application and search engine integration | |
US20060041589A1 (en) | System and method for clipping, repurposing, and augmenting document content | |
US20140052778A1 (en) | Method and apparatus for mapping a site on a wide area network | |
US20070220415A1 (en) | Excel spreadsheet parsing to share cells, formulas, tables or entire spreadsheets across an enterprise with other users | |
US8826112B2 (en) | Navigating table data with mouse actions | |
US20080294978A1 (en) | Semantic navigation through web content and collections of documents | |
WO2007134894A1 (en) | A method of bookmarking in an internet browser | |
EP1974318A2 (en) | System, process and software arrangement for assisting in navigating the internet | |
US20050240489A1 (en) | Retaining custom item order |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
AK | Designated states |
Kind code of ref document: C2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: C2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
COP | Corrected version of pamphlet |
Free format text: PAGES 1/11-11/11, DRAWINGS, REPLACED BY NEW PAGES 1/11-11/11 |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS (R.69(1) EPC) SENT ON 061202 |
|
122 | Ep: pct application non-entry in european phase |