'Computer Science' Category
POTW 5/14/07: Discussion of “Discovering Trends in Text Databases” by Lent et. al.
This week’s paper, “Discovering Trends in Text Databases” by Lent is my first look at some text mining tools and applications. The paper discusses a method for identifying trends in databases. In this case, a trend is defined as “a specific subsequence of the history of a phrase that satisfies the users’ query over the [...]
POTW 5/14/07: “Discovering Trends in Text Databases” by Lent et. al.
Ah, good to be back! This week’s paper is “Discovering Trends in Text Databases” by Brian Lent, Rakesh Agrawal and Ramakrishnan Srikant.
POTW to return next week
I will be returning to writing next week after a great ApacheCon Europe conference last week. My “Advanced Lucene” slides are available at http://www.cnlp.org/presentations/present.asp?show=conference Next week, I think I am going to start looking into things like event detection, etc. However, I am also considering looking into some non-NLP areas related to data mining, so [...]
Some papers on NLP
Latent Semantic Analysis http://lsi.argreenhouse.com/lsi/papers/JASIS90.pdf You are all no doubt familiar with Google and the way in which web pages are found using keywords. Unfortunately this method can break down. If for example you want to find references to the game of “bridge” you are (in English) swamped with references to civil engineering. I even suggested [...]
Guest Contributor: Ian Parker
I am pleased to announce I’ve found a guest contributor for the next few weeks. His name is Ian Parker and he blogs at http://ipai.blogspot.com/ Here is a brief bio from Ian: I am a retired scientist. I gained my PhD in theoretical Solid State Physics from the University of Sussex (England) in 1969. I [...]