[FoRK] Text trees: semi-awesome

J. Andrew Rogers <andrew at ceruleansystems.com> on Thu Feb 14 10:44:05 PST 2008

On Feb 14, 2008, at 8:21 AM, Jeff Bone wrote:
> Going to have to dig in and figure out how the clustering occurs,  
> but it seems to me that this kind of destructuring topological sort  
> might be a very, very useful tool in advancing the state-of-the-art  
> in natural language understanding by machines.  (Anybody know if  
> this kind of thing's already being used for that sort of purpose?


It is suffix tree clustering, and has been around in various forms  
since the 1970s.  From the standpoint of natural language processing,  
it is essentially equivalent to text compression methods (IIRC  
something like suffix trees are used in the Burrows-Wheeler transform)  
with all the capabilities and limitations implied.

Cheers,

J. Andrew Rogers



More information about the FoRK mailing list