'Statistical Approach' Category
Discussion of Joachims (SVMs)
This week, if you remember, we are discussing Text Categorization with Support Vector Machines: Learning with Many Relevant Features – Joachims (ResearchIndex), which is a paper on Text Categorization (one of the most cited such papers on Google Scholar under the Text Categorization search). Text Categorization is the problem of assigning one or more predefined [...]
Discussion of Sections 5-7 of Yang 97
Whew, I think we’ve made it through our first paper, or we are about to anyway. If you recall, we are working our way through Yang 97 and had made it through the first 4 sections so far, which are covered here. This leaves us with the meat of the paper, I guess, which is [...]
Discussion of Sections 1-4 of Yang 97
So, hopefully everyone has read the paper (http://www.paperoftheweek.com/2007/01/08/an-evaluation-of-statistical-approaches-to-text-categorization-yang-researchindex/) at least once. The first 4 sections are quite easy to get, in my opinion, as they define the problem of text categorization and lay the framework for the experiments. Digging into the details of the various implementations will be left as an exercise for the reader [...]
An Evaluation of Statistical Approaches to Text Categorization – Yang (ResearchIndex)
An Evaluation of Statistical Approaches to Text Categorization – Yang (ResearchIndex) is the first paper of the week that I am going to tackle. Finding a first paper was harder than I thought. I originally intended to start with some of the seminal papers in IR, such as Salton’s work, or Sparck-Jones, but it seems [...]