08402026 is referenced by 13 patents and cites 199 patents.

A system and method for efficiently generating cluster groupings in a multi-dimensional concept space is described. A plurality of terms is extracted from each document in a collection of stored unstructured documents. A concept space is built over the document collection. Terms substantially correlated between a plurality of documents within the document collection are identified. Each correlated term is expressed as a vector mapped along an angle θ originating from a common axis in the concept space. A difference between the angle θ for each document and an angle σ for each cluster within the concept space is determined. Each such cluster is populated with those documents having such difference between the angle θ for each such document and the angle σ for each such cluster falling within a predetermined variance. A new cluster is created within the concept space those documents having such difference between the angle θ for each such document and the angle σ for each such cluster falling outside the predetermined variance.

Title
System and method for efficiently generating cluster groupings in a multi-dimensional concept space
Application Number
10/911376
Publication Number
8402026 (B2)
Application Date
August 3, 2004
Publication Date
March 19, 2013
Inventor
Dan Gallivan
Bainbridge Island
WA, US
Agent
Krista A Wittman
Patrick J S Inouye
Assignee
FTI Technology
MD, US
IPC
G06F 17/30
G06F 7/00
View Original Source