'Information Retrieval' Category
POTW 6/11/07: Discussion of “A Sequential Algorithm for Training Text Classifiers” by Lewis and Gale
In “A Sequential Algorithm for Training Text Classifiers” by David D.
Lewis and William Gale, the authors put forth a new (at the time)
method training text classifiers using an approach they call
“uncertainty sampling”
Section 1 outlines the problem of training, namely obtaining a good
sample of text to be labeled for the trainer. After disposing of
several other methods [...]Popularity: 21% [?]
Google’s initiatives in Artificial Intelligence
Introduction
Google’s earnings nearly doubled last year.
http://news.com.com/Google+profit+nearly+doubles/2100-1030_3-6127658.html
Unlike Microsoft that gets its money from shifting boxes Google relies on advertising to pay its way. There is a tremendous incentive to improve the quality of searching. The first reason is obvious. The better Google is perceived to perform as a search engine, the more people will use Google [...]Popularity: 100% [?]
POTW 5/21/07: Discussion of “A Study on Retrospective and On-Line Event Detection” by Yang, Pierce and Carbonell
Yang’s paper on on-line event detection (”A Study on Retrospective and On-Line Event Detection“) discusses the use of common text retrieval techniques to automatically detect events in news streams.
Imagine that you are responsible for monitoring all the major news feeds in every single country your company does business in order to advise the CEO on [...]Popularity: 12% [?]
Guest Contributor wanted for next 3 weeks
If you have an interest in writing on artificial intelligence, clustering, information retrieval or computer science in general and are interested in reviewing one or more articles over the coming three weeks on this forum, please contact me by leaving a comment on this post. All topics will be subject to my review for appropriateness, [...]
Popularity: 10% [?]
POTW 4/8/07: “Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections” by Cutting, et.al
This week’s paper is Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections by Cutting, et. al. This paper, from 1992, takes clustering out of the realm of search, as it was previously used, albeit indifferently, and proposes to use it in a document browsing scenario. In doing so, the authors propose a [...]
Popularity: 4% [?]

