US 6,983,282 B2 | ||
Computer method and apparatus for collecting people and organization information from Web sites | ||
Jonathan Stern, Newton, Mass. (US); Kosmas Karadimitriou, Shrewsbury, Mass. (US); Jeremy W. Rothman-Shore, Cambridge, Mass. (US); and Michel Decary, Montreal (Canada) | ||
Assigned to Zoom Information, Inc., Cambridge, Mass. (US) | ||
Filed on Mar. 30, 2001, as Appl. No. 9/821,908. | ||
Claims priority of provisional application 60/221750, filed on Jul. 31, 2000. | ||
Prior Publication US 2002/0052928 A1, May 02, 2002 | ||
Int. Cl. G06F 17/30 (2006.01) |
U.S. Cl. 707—102 | 25 Claims |
1. A method for collecting people and organization information from Web sites in a global computer network comprising the
steps of:
accessing a Web site of potential interest, the Web site having a plurality of Web pages;
determining a subset of the plurality of Web pages to process; and
for each Web page in the subset, (i) determining types of contents found on the Web page, and (ii) based on the determined
content types, enabling extraction of people and organization information from the Web page.
|