Archive for April, 2007
Some papers on NLP
Latent Semantic Analysis
http://lsi.argreenhouse.com/lsi/papers/JASIS90.pdf
You are all no doubt familiar with Google and the way in which web pages are found using keywords. Unfortunately this method can break down. If for example you want to find references to the game of “bridge” you are (in English) swamped with references to civil engineering. I even suggested to the [...]Popularity: 3% [?]
Guest Contributor: Ian Parker
I am pleased to announce I’ve found a guest contributor for the next few weeks. His name is Ian Parker and he blogs at http://ipai.blogspot.com/
Here is a brief bio from Ian:
I am a retired scientist. I gained my PhD in theoretical Solid State Physics from the University of Sussex (England) in 1969. I have worked [...]Popularity: 5% [?]
Guest Contributor wanted for next 3 weeks
If you have an interest in writing on artificial intelligence, clustering, information retrieval or computer science in general and are interested in reviewing one or more articles over the coming three weeks on this forum, please contact me by leaving a comment on this post. All topics will be subject to my review for appropriateness, [...]
Popularity: 10% [?]
POTW 4/8/07: “Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections” by Cutting, et.al
This week’s paper is Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections by Cutting, et. al. This paper, from 1992, takes clustering out of the realm of search, as it was previously used, albeit indifferently, and proposes to use it in a document browsing scenario. In doing so, the authors propose a [...]
Popularity: 4% [?]
POTW: 4/8/07: “Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections” by Cutting, et.al
This week’s paper is “Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections” by Cutting, Karger, Pedersen and Tukey. This is one of Doug Cutting’s older works on clustering, pre Lucene fame.
Popularity: 4% [?]Popularity: 4% [?]

