<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Paper of the Week &#187; Question Answering</title>
	<atom:link href="http://www.paperoftheweek.com/category/computer-science/question-answering/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.paperoftheweek.com</link>
	<description>Read. Learn. Discuss.</description>
	<lastBuildDate>Tue, 14 Aug 2007 01:35:31 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Google&#8217;s initiatives in Artificial Intelligence</title>
		<link>http://www.paperoftheweek.com/2007/06/17/googles-initiatives-in-artificial-intelligence/</link>
		<comments>http://www.paperoftheweek.com/2007/06/17/googles-initiatives-in-artificial-intelligence/#comments</comments>
		<pubDate>Sun, 17 Jun 2007 11:42:03 +0000</pubDate>
		<dc:creator>Ian Parker</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>
		<category><![CDATA[Text Categorization]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/06/17/googles-initiatives-in-artificial-intelligence/</guid>
		<description><![CDATA[Introduction Google&#8217;s earnings nearly doubled last year. http://news.com.com/Google+profit+nearly+doubles/2100-1030_3-6127658.html Unlike Microsoft that gets its money from shifting boxes Google relies on advertising to pay its way. There is a tremendous incentive to improve the quality of searching. The first reason is obvious. The better Google is perceived to perform as a search engine, the more people [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Introduction</strong><br />
Google&#8217;s earnings nearly doubled last year.<br />
http://<a href="http://news.com.com/Google+profit+nearly+doubles/2100-1030_3-6127658.html">news.com.com/Google+profit+nearly+doubles/2100-1030_3-6127658.html</a></p>
<p>Unlike Microsoft that gets its money from shifting boxes Google relies on advertising to pay its way. There is a tremendous incentive to improve the quality of searching. The first reason is obvious. The better Google is perceived to perform as a search engine, the more people will use Google for their searches and the greater the traffic for advertisers. The second reason is a little bit more sinister. Google gets paid according to the number of clicks made on an advertisement. Google as well as telling you the results of your search needs also to put some ads your way. The share price of Google is closely linked to the perceived quality of search.</p>
<p><strong>The quest for AI</strong><br />
As one might expect Google is deeply into AI. AI one might argue is essentially what the core business of Google depends on. Suppose we can take a web page, find out exactly what it is about, extract all the relevant facts and put them into a database, then on the prompting of a query from a user marshal all the facts that are relevant to that enquiry. This is what an AI system looking at web pages would essentially do. </p>
<p><a href="http://news.com.com/2100-11395_3-6160372.html">http://news.com.com/2100-11395_3-6160372.html</a><br />
Google is talking about the size of the human genome and the size of AI. I think the arguments are a little bit misleading. I would prefer to look at what we would expect from AI. Suppose I were to show you a box and I told you that that box was &#8220;<em>intelligent</em>&#8220;. What would you expect. Well Alan Turing devised what is now known as the Turing Test. He said that if the response of a computer to a conversation was indistinguishable from that of a human, it had passed the Turing Test. </p>
<p>On the subject of the Turing Test, Alan Turing envisaged a test which would distinguish between men and women and also would be psychic. Turing believed in ESP. Looking at Alice I am aghast, whenever I say something she always changes the subject. Hardly surprising in view of the Spanish! (<em>La estacion de resorte &#8211; El barco attravesta una cerradura</em>)</p>
<p>In other words I would expect to be able to ask questions and get an intelligible response. I could engage in a conversation if I wanted greater depth. If the box claimed to speak Spanish I would expect translations which showed an understanding of context. In fact it could not produce an intelligible response without context. We would also want the answer to statistical questions, like how do people like BMW cars? What is the correlation between this and that? Can we deduce anything about cancer from the people who get it their lifestyles etc?</p>
<p>We would also like to see some evidence of reasoning ability. Google is not committed specifically to reasoning. In a sense reasoning comes after the ability to retrieve efficiently. This has been discussed by myself and other people in &#8220;Creating Artificial Intelligence&#8221;<br />
<a href="http://groups.google.co.uk/group/creatingAI?hl=en">http://groups.google.co.uk/group/creatingAI?hl=en</a><br />
I have also written the following blogs.</p>
<p><a href="http://ipai.blogspot.com/">http://ipai.blogspot.com/</a><br />
<a href="http://ipai1.blogspot.com/">http://ipai1.blogspot.com/</a><br />
<a href="http://ipai2.blogspot.com/">http://ipai2.blogspot.com/</a></p>
<p>One thing to remember and that is that the ability to find facts is closely related to the ability to automatically construct wrappers. This is one of the main features of Web 3.0.</p>
<p>Let us now return to Google and what they are doing to produce a Web based AI</p>
<p><strong>Searching &#8211; The fundamentals</strong><br />
Search engines are basically databases. The information which is contained in the database has changed throughout the years. What the user needs to know about a Web page is :-<br />
1) What is it about?<br />
2) How is it rated, is it written by a crank or does it contain good and useful stuff?<br />
<a href="http://infolab.stanford.edu/~backrub/google.html ">http://infolab.stanford.edu/~backrub/google.html </a><br />
Describes the main techniques used in search engines.<br />
Google became the primary search engine on the basis of what might be termed a citation index. Scientists have used this principle almost from the year dot. At the bottom of an academic paper are references and these references are &#8220;citations&#8221;. The &#8220;Science Citation Index&#8221; is an index of papers which cite a given paper. Now a paper which is frequently cited is generally regarded as being a good paper. Google does exactly the same things with hyperlinks. There is also the number of times other people access a website.</p>
<p><a href="http://209.85.163.132/papers/sawzall-sciprog.pdf">http://209.85.163.132/papers/sawzall-sciprog.pdf</a><br />
The Web is of course very large and Google has to find a way of dividing up the tasks. This paper is the key to the way in which Google does this. The database is far too large to place on a single machine, and is therefore stored on a number of servers. Sawzall is quite ingenious. A query is passed round from server to server, but while the query is in transit other queries are being worked on. Hence although a query takes a few seconds to process on the network, the fact that other queries  can be processed at the same time means that a high throughput is maintained. One quite important fact is that it is possible to discuss aggregations. That is to say once websites are found with their keywords a further search based on programs written in C++ can be performed.</p>
<p><a href="http://infolab.stanford.edu/pub/papers/google.pdf">http://infolab.stanford.edu/pub/papers/google.pdf</a><br />
Describes what Google was doing in 2000</p>
<p>Google wants to know your surfing history. This will enable it to both target web pages and ads. Suppose I am a civil engineer and I enter &#8220;Bridge&#8221; as one of my search terms. A civil engineer is interested in &#8220;puente that is the sort of bridge that crosses a river. If I am a card player I will be interested in the game of Bridge. A website containing &#8220;4 hearts&#8221;, is about a card game.</p>
<p>Google also wants to target it advertisments. It wants to know what you think of a particular organization.<br />
<a href="http://ryanmcd.googlepages.com/sentimentACL07.pdf">http://ryanmcd.googlepages.com/sentimentACL07.pdf</a><br />
Does just that. It used a training set There is of course one other thing. Advertisers like some sort of feedback on how they and their product is perceived. This paper attempts to achieve this and manages to achieve scores approximating to 80%.</p>
<p>It is not my aim to make moral judgements about Google. Google in fact, unlike Microsoft, has not broken the law. Indeed the Google code is mostly open source. How it is all put together is highly proprietary, but there are references to source code in all the papers. If you are bundling inaccessible code with a inferior operating system (Windows as a sheer operating system is inferior to Linux.) a fine of x million Euros a day is appropriate. Google technology is immensely powerful and society will have to come to terms with it in some way.</p>
<p><strong>Google and Semantic Analysis</strong><br />
<a href="http://www2007.org/papers/paper342.pdf">http://www2007.org/papers/paper342.pdf</a><br />
This is a most remarkable paper. Let us dissect some of the terminology. It talks about &#8220;Vectors&#8221;. What are these &#8220;vectors&#8221;? They are all derived from Latent Semantic Analysis, or some other allied method. It talks about partially indexing the vectors (not storing the full vector). It takes queries and search results. It actually looks at what people have put into their query as keywords and the web pages they actually click on. An algorithm is developed for giving people exactly what they want. The paper makes great play on optimization for an inveted file search. Now an inverted file is a database file where the entries are indexed. Quite clearly if you are doing web based searches</p>
<p><a href="http://labs.google.com/papers/orkut-kdd2005.html ">http://labs.google.com/papers/orkut-kdd2005.html </a></p>
<p>This paper is 2007 so its results are not yet in &#8220;Google&#8221;. The methodology is amazingly powerful and could be applied in a variety of circumstances. Slightly chillingly the &#8220;Orkut&#8221; set which correlates friendship and personality and other similarities is used. The paper can effectively find you matches and build you up a friendship network. Equally it can judge you by the friends that you have!</p>
<p>Potentially you could take El Cid and its English translation and match words up. Rather you are not just matching words you are matching vectors. An inverted file then gives the correct Spanish translation for an English vector and vice versa. This program will take any set of vector pairs and do a match.</p>
<p><strong>Translation</strong><br />
At present translation with Google Translate is rather poor.<br />
<em>El barco attravesta una cerradura </em>- The boat goes through a lock<br />
<em>La estacion de resorte </em>- The season of spring.</p>
<p><a href="http://www.stefanriezler.com/PAPERS/NAACL06.pdf ">http://www.stefanriezler.com/PAPERS/NAACL06.pdf </a><br />
Google have in 2006 recruited Stefan Riezler. It is interesting in that it indicates a direction in which Google is moving. Here is his CV<br />
<a href="http://www.stefanriezler.com/CV07.pdf">http://www.stefanriezler.com/CV07.pdf</a><br />
It is probably a pretty good summary of the way in which Google intend to go. One thing should be pointed out straight away and that is that is that the Google NLP initiatives are based on strict parsing as their starting point. This contrasts with some versions of Latent Semantic Analysis where unparsed words are entered. Google looks at subjects, verbs, adjectives, adverbs, objects and possessives. Google is also interested in question and answer responses.</p>
<p><a href="http://www.cs.nyu.edu/~mohri/postscript/hbka.pdf">http://www.cs.nyu.edu/~mohri/postscript/hbka.pdf</a><br />
This paper is a review article about a very much related area of speech recognition. I think I should say straight away that the recognition of individual phonemes by computer is as good if not better than that performed manually. The reason why human speech recognition is better than that of a machine is that humans recognize words in context. This in fact makes speech very similar to translation. I can illustrate this with words that have different meanings and spellings but the same phoneme structure. Whether (<em>si</em>), weather (<em>tiempo</em>)  hear (<em>oir</em>), here (<em>aqui</em>). One thing that is a little bit disappointing is that the speech and NLP groups  in Google appear to be working independently.</p>
<p>Speech is in fact a far harder problem than translation or the discernment of meaning from text. This is because in translating from text you have fewer choices. The methos used is Markov chains and the association of neighbouring words including grammar. Interestingly in neither Riezler&#8217;s work or this are words chosen on the basis of long range meaning. Let us say we have a medical paper and we could bias the search to medical terms. They do not seem to do this.</p>
<p>To produce the right words in speech you need an iterative annealing process. This means you may wish to change the phoneme, or word, assignment once other words have been found.</p>
<p><a href="http://www.stefanriezler.com/PAPERS/ACL07.pdf">http://www.stefanriezler.com/PAPERS/ACL07.pdf</a><br />
Suppose I am not looking for a website. Suppose I want to know a fact. &#8220;What is the velocity of light?&#8221;, &#8220;What is somebody&#8217;s address?&#8221;, &#8220;What is the turnover of company X?&#8221;. To answer a question the question needs to be parsed so that its meaning can be ascertained. We are here quite close to the Turing test.<br />
<a href="http://www.cs.cmu.edu/~acarlson/semisupervised/million-fact-aaai06.pdf ">http://www.cs.cmu.edu/~acarlson/semisupervised/million-fact-aaai06.pdf </a><br />
<a href="http://www.cs.bell-labs.com/cm/cs/who/pfps/temp/web/www2007.org/papers/paper560.pdf ">http://www.cs.bell-labs.com/cm/cs/who/pfps/temp/web/www2007.org/papers/paper560.pdf </a></p>
<p>This is the first stage of Google&#8217;s program. A database of, initially, a million facts will be gathered. These facts are going into a database which will be used to answer questions. This will of course be extended as time goes on.</p>
<p><strong>Head to Head with Microsoft</strong><br />
Google has a spreadsheet and a word processor. It also features desktop publishing.<br />
<a href="https://www.google.com/accounts/ServiceLogin?service=writely&amp;passive=true&amp;continue=http%3A%2F%2Fdocs.google.com%2F%3Fhl%3Den_GB&amp;hl=en_GB&amp;ltmpl=homepage&amp;nui=1&amp;utm_source=en_GB-more&amp;utm_medium=more&amp;utm_campaign=en_GB">https://www.google.com/accounts/ServiceLogin?service=writely&amp;passive=true&amp;continue=http%3A%2F%2Fdocs.google.com%2F%3Fhl%3Den_GB&amp;hl=en_GB&amp;ltmpl=homepage&amp;nui=1&amp;utm_source=en_GB-more&amp;utm_medium=more&amp;utm_campaign=en_GB</a><br />
There are advantages and disadvantages in using the Web for basic word processing and spreadsheets. The advantages are that the software is :-</p>
<p>1) Up to date.<br />
2) Will run of both Linux and windows systems.<br />
3) Is free.<br />
4) There are facilities for work sharing.</p>
<p>http://labs.google.com/papers/gfs.html</p>
<p>5) Your work is backed up automatically.</p>
<p>The disadvantages are that you need to be connected to the Web to access your work. There are question marks over security, although to be fair Google is investing a considerable effort in this field. </p>
<p><a href="http://labs.google.com/papers.html">http://labs.google.com/papers.html</a><br />
This gives a list of Google papers. Note those on security. I have not mentioned them individually since my main thrust is AI.<br />
I feel that we should look at spellcheckers and how word processing and AI can be integrated. Often when we spell words wrong the spelling is valid but means something different. People will often spell words that sound the same wrongly. This puts spellcheckers in the same position as translators, and on a Web spell-check the latest translator can be used. If I use a translator as a spellchecker I am one stage up on anything Microsoft has produced. If you are writing in Spanish &#8220;<em>si</em>&#8221; and &#8220;<em>tiempo</em>&#8221; are never confused. In English large number of people confuse &#8220;whether&#8221; and &#8220;weather&#8221;. Present day spellcheckers pass both.</p>
<p>There is one other point. If I want to write something learned, I want references. If I write on web software Google can suggest them to me. If I have Microsoft software on my own computer, it cannot do this.</p>
<p><strong>Conclusion</strong><br />
I started off this investigation rather skeptical. I came away from Google translate distinctly unimpressed. &#8220;La estacion de resorte&#8221; &#8211; I did not know stations were elastic! I came away deeply impressed with the work which Google are doing and its scope. My criticism that the research on Natural Language should involve more interchange of information is perhaps rather carping, considering the difficulties involved in running a program on this scale.<br />
On the question of personal information I can see where Google is coming from. Lets put it this way. If you meet a friend in the street you will have remembered some of the &#8220;personal&#8221; information that they have told you if you were to start a conversation. We can thus show that any Turing machine must store personal information, you need personal information stored if you are ever going to &#8220;talk to Google&#8221;. It is also vital for proper retrieval of information. The information you get must be relevant to you.<br />
<a href="http://michaelaltendorf.wordpress.com/2007/06/13/top-100-alternative-search-engines-from-readwrite-web/">http://michaelaltendorf.wordpress.com/2007/06/13/top-100-alternative-search-engines-from-readwrite-web/</a><br />
The whole point of search engine technology is to get relevant references and facts. This reference misses this point completely. If you need a 3D display your basic engine is lousy.</p>
<p>Google is now entering the world of facts rather than just websites. This could have some very interesting consequences in the future.<br />
There is one fact that society in the future will have to come to terms with. To become president of the United States you need television exposure. Television, telephones and the Internet are now becoming one. Who will choose the programs you watch? Why Google of course. This is a tremendous responsibility.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/06/17/googles-initiatives-in-artificial-intelligence/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Guest Contributor wanted for next 3 weeks</title>
		<link>http://www.paperoftheweek.com/2007/04/15/guest-contributor-wanted-for-next-3-weeks/</link>
		<comments>http://www.paperoftheweek.com/2007/04/15/guest-contributor-wanted-for-next-3-weeks/#comments</comments>
		<pubDate>Sun, 15 Apr 2007 19:40:05 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>
		<category><![CDATA[Text Categorization]]></category>
		<category><![CDATA[disambiguation]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/04/15/guest-contributor-wanted-for-next-3-weeks/</guid>
		<description><![CDATA[If you have an interest in writing on artificial intelligence, clustering, information retrieval or computer science in general and are interested in reviewing one or more articles over the coming three weeks on this forum, please contact me by leaving a comment on this post.  All topics will be subject to my review for appropriateness, [...]]]></description>
			<content:encoded><![CDATA[<p>If you have an interest in writing on artificial intelligence, clustering, information retrieval or computer science in general and are interested in reviewing one or more articles over the coming three weeks on this forum, please contact me by leaving a comment on this post.  All topics will be subject to my review for appropriateness, but I am open to most any article or publication in the Computer Science field.</p>
<p>Otherwise, I will be taking a brief hiatus from reviewing papers until I return from <a href="http://www.eu.apachecon.com">ApacheCon Europe</a> in the early part of May where I am giving a talk and a training on Lucene.  I have several key deadlines over the next two weeks that must take higher priority, including the publication of a couple of articles that I have been working on.  I will post details on the publication on my <a href="http://lucene.grantingersoll.com/">Lucene blog</a> when they are available.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/04/15/guest-contributor-wanted-for-next-3-weeks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>POTW 3/19/07: Discussion of &#8220;An Analysis of the AskMSR Question-Answering System&#8221;</title>
		<link>http://www.paperoftheweek.com/2007/03/24/potw-31907-discussion-of-an-analysis-of-the-askmsr-question-answering-system/</link>
		<comments>http://www.paperoftheweek.com/2007/03/24/potw-31907-discussion-of-an-analysis-of-the-askmsr-question-answering-system/#comments</comments>
		<pubDate>Sat, 24 Mar 2007 22:53:34 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/24/potw-31907-discussion-of-an-analysis-of-the-askmsr-question-answering-system/</guid>
		<description><![CDATA[Better late than never is what they say&#8230; I&#8217;ve been absolutely slammed this week trying to get out a proposal and finish up my slides for ApacheCon Europe. So, here goes on the POTW for this week. If you recall, we are reading &#8220;An Analysis of the AskMSR Question-Answering System&#8221; by Brill, Dumais and Banko. [...]]]></description>
			<content:encoded><![CDATA[<p>Better late than never is what they say&#8230;  I&#8217;ve been absolutely slammed this week trying to get out a proposal and finish up my slides for <a href="http://www.eu.apachecon.com">ApacheCon Europe</a>.  So, here goes on the POTW for this week.  If you recall, we are reading &#8220;<a href="http://research.microsoft.com/~sdumais/EMNLP_Final.pdf">An Analysis of the AskMSR Question-Answering System</a>&#8221; by Brill, Dumais and Banko.</p>
<p>Microsoft&#8217;s AskMSR system, as detailed in this paper, takes  a different tact  for trying to solve the QA problem.  Unlike most systems that use sophisticated NLP techniques, AskMSR makes the bet that somewhere on the web the exact answer is written in just the right way to answer the question.  Their approach is touted to be much simpler and much more efficient.  Also, they suggest this approach is complementary to the other approaches, even though in this paper they are trying to see just how far they can get.</p>
<p>AskMSR relies on the Internet as a massively redundant data source.  Section 2 of the paper lays out the System Architecture, which can be br0ken down into 4 sections:</p>
<ol>
<li>Query Reformulation
<ol>
<li>Takes the input query and rewrites into a number of different possible answers.  As a last resort, they also produce some less precise versions made up of non-stopwords.  No POS tagger or parser is used.</li>
</ol>
</li>
<li>n-gram mining
<ol>
<li>Given the rewrites, they submit them to a search engine, get the page summaries and use n-grams to calculate candidate answers and score them.</li>
</ol>
</li>
<li>Filtering
<ol>
<li>Given the n-grams, AskMSR then filters and reweights the answers based on answer-type, as specified by a few in-house, handwritten filters.  This is most likely one area where more advanced NLP techniques could be used to determine.</li>
</ol>
</li>
<li>n-gram tiling
<ol>
<li>The tiling algorithm merges similar answers and creates longer answers from overlapping answers.</li>
</ol>
</li>
</ol>
<p>Section 3 then details the experiments that were run, how they performed and how the different components contributed.  Ironically, they use Google for their search engine, but my guess is there current system does not.  Section 3.2 lays out how important each of the individual components are to the process.    While query rewrites weighting are pretty important (11.2% drop in performance when all are weighted equally), n-gram filtering is the single most important part, contributing nearly 18% over the baseline.  Tiling rounds things off by contributing approximately 14% to the process.</p>
<p>Section 4 has a discussion of how the various components contribute to errors.  Interestingly, many of the errors are due to their projection of answers onto the TREC-9 corpus and wouldn&#8217;t really be an issue in a live system.  Another interesting error source is the inability to query the search engine for numbers without including the number in the query:</p>
<blockquote><p>&#8220;For example, a good rewrite for the query <em>How many islands does Fiji have</em> wold be &lt;&lt;<em>Fiji has &lt;NUM&gt; islands &gt;&gt;</em> but we are unable to give this type of query to the search engine&#8221;</p></blockquote>
<p>It seems to me, though, that this could be addressed by some type of bounded approach.   For instance, why not just iterate through trying some numbers, within reason.   I wonder, too, how good their potential answers would be if they just put in any number.   I tried the Google query: &#8220;Fiji number islands&#8221; and the top three results have the answer and the third one, I think would be handled by the n-gram approach.</p>
<p>Section 5 is a good discussion on knowing when the AskMSR system does not know.  They use a decision tree that uses a number of factors, like query length, number of stopwords, etc. The thought process behind this is that why bother trying to answer questions that you know have a very low likelihood of being answered in the first place.  For example, they know that they don&#8217;t do very well on how questions, so they could simply say they don&#8217;t know the answer or just return keyword search results for the question (in a live system).  Of course, it all depends on how tolerant your users are of incorrect answers.</p>
<p>Well, that&#8217;s it for this week.  Haven&#8217;t decided on next week yet.  May do another QA paper, may look into some clustering algorithms.  Anyone have anything in particular they want to look at?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/24/potw-31907-discussion-of-an-analysis-of-the-askmsr-question-answering-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>POTW 3/19/07: &#8220;An Analysis of the AskMSR Question-Answering System&#8221; by Brill, et. al.</title>
		<link>http://www.paperoftheweek.com/2007/03/18/potw-31907-an-analysis-of-the-askmsr-question-answering-system-by-brill-et-al/</link>
		<comments>http://www.paperoftheweek.com/2007/03/18/potw-31907-an-analysis-of-the-askmsr-question-answering-system-by-brill-et-al/#comments</comments>
		<pubDate>Mon, 19 Mar 2007 01:10:09 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/18/potw-31907-an-analysis-of-the-askmsr-question-answering-system-by-brill-et-al/</guid>
		<description><![CDATA[Paper of the Week for March 19, 2007 is &#8220;An Analysis of the AskMSR Question-Answering System&#8221; by Bill, Dumais and Banko. Let&#8217;s have a look at how Microsoft does QA, shall we?]]></description>
			<content:encoded><![CDATA[<p>Paper of the Week for March 19, 2007 is &#8220;<a href="http://research.microsoft.com/~sdumais/EMNLP_Final.pdf">An Analysis of the AskMSR Question-Answering System</a>&#8221; by Bill, Dumais and Banko.   Let&#8217;s have a look at how Microsoft does QA, shall we?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/18/potw-31907-an-analysis-of-the-askmsr-question-answering-system-by-brill-et-al/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>POTW 3/11/07: Discussion of &#8220;COGEX: A Logic Prover for Question Answering&#8221;</title>
		<link>http://www.paperoftheweek.com/2007/03/16/potw-31107-discussion-of-cogex-a-logic-prover-for-question-answering/</link>
		<comments>http://www.paperoftheweek.com/2007/03/16/potw-31107-discussion-of-cogex-a-logic-prover-for-question-answering/#comments</comments>
		<pubDate>Fri, 16 Mar 2007 12:59:10 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/16/potw-31107-discussion-of-cogex-a-logic-prover-for-question-answering/</guid>
		<description><![CDATA[This week we are digging deeper into the published work of Language Computer Corporation&#8216;s QA (they have an online demo on their home page) system by looking at COGEX: A Logic Prover for Question Answering by Moldovan, Clark, Harabagiu and Maiorano.  If you recall from a few weeks ago, the LCC system was the top [...]]]></description>
			<content:encoded><![CDATA[<p>This week we are digging deeper into the published work of <a href="http://www.languagecomputer.com/">Language Computer Corporation</a>&#8216;s QA (they have an online demo on their home page) system by looking at <a href="http://citeseer.ist.psu.edu/rd/70536349%2C656408%2C1%2C0.25%2CDownload/http://citeseer.ist.psu.edu/cache/papers/cs/32721/http:zSzzSzacl.ldc.upenn.eduzSzNzSzN03zSzN03-1022.pdf/cogex-a-logic-prover.pdf">COGEX: A Logic Prover for Question Answering</a> by Moldovan, Clark, Harabagiu and Maiorano.  If you recall from a <a href="http://www.paperoftheweek.com/2007/03/01/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees-2/">few weeks ago</a>, the LCC system was the top performer at TREC 2003 (see  last week&#8217;s <a href="http://www.paperoftheweek.com/2007/03/04/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al/">POTW</a>  for a discussion of the overall system.)</p>
<p>In this paper, LCC demonstrates that using a theorem proving strategy to validate answers found through traditional QA means can improve performance by over 30% on TREC style questions.  To support this claim, LCC takes us through, at a high level, the steps involved in creating a theorem prover for use in QA.</p>
<p>Section 1 introduces us to the problem at hand and why they think COGEX is useful for QA and what types of challenges were overcome to implement it.  To summarize, COGEX creates logical representations of the questions, candidates and other world and language knowledge to re-rank candidate answers and remove incorrect answers.  It should be noted that it doesn&#8217;t identify the candidates to begin with, just &#8220;proves&#8221; they are correct.  The main technical challenges in the approach occur in two main areas: creating the logically representation of the necessary world knowledge and other inputs and the high failure rates and long processing time required to apply the prover (I wonder if LCC has overcome these for their online demo, or if they don&#8217;t use the theorem prover in the demo.)</p>
<p>Section 2 outlines how the theorem prover is integrated into the QA system.  They have a module called the Axiom Builder which takes in various free text inputs and WordNet glosses and builds logical representations of these.  On a side note, I wonder about the storage and retrieval algorithms used for these.  I would imagine one needs a way of quickly looking up the appropriate pieces, especially when it comes to the Wordnet glosses.  All of this input is fed into the justification module, which tries to prove out the theorem, relying on some relaxation techniques if the proof fails.</p>
<p>Getting into the guts of the program, sections 3 and 4 discuss in detail how to create logical representations of the text and the world knowledge incorporated into the system.  For the text, they use what they call a &#8220;logic form&#8221; which is a middle ground between a syntactic parse and a deep semantic representation.  In other words, they capture the various pieces of the structure of the text that they think are important, in this case: &#8220;(1) syntactic subjects, (2) syntactic objects, (3) prepositional attachments, (4) complex nominals, and (5) adjectival/adverbial adjuncts&#8221; (page 2).  To build the logic form, they map the words to the predicates, using the base form plus part of speech info.  Nouns are supplemented with a tag that allows it to be referred to from other predicates.  From here, they use grammar rules to construct the actual representation.  Since there are so many grammar rules, they observe that the top ten most frequently used rules cover more than 90% of the cases that they need in WordNet.  If I understand this correctly, they are saying common grammar rules such as Subject Verb Object, etc. are sufficient for most of their cases.</p>
<p>The next part  of the paper details how to use WordNet glosses to build a bank of world knowledge for use in the theorem prover by developing lexical chains linking synsets across hierarchies.  This is useful for increasing passage retrieval and extractions, as well.  The chains codify topically related concepts and are used in the proving algorithm.  Several examples are given at the end of section 4 for those interested in seeing more details.</p>
<p>Before getting to how the theorem prover works in section 6, section 5 discusses how NLP Axioms are built for the various language nuggets they are interested in, such as complex nominals, possessives, etc.  One thing to note is these axioms have strengths associated with them that play a role in scoring the answers in the prover.  Weaker axioms result in weaker confidence in a given proof.</p>
<p>Sections 6 and 7 discuss how the axioms are used and show an in-depth example of the process in action.  Reading the syntax is a bit mind numbing, but essentially it relies on a couple key pieces: hyperresolution, paramodulation and proof by contradiction.   See the subsection of section 6 on Inference Rules for definitions of hyperresolution and paramodulation.  In my naive understanding, they are ways of reducing and substituting to make the proof more manageable.  Proof by contradiction is simply a way of proving something by assuming it is incorrect and then logically deducing that if it were incorrect than some known fact would be wrong.  In the example of section 7, they assume there is NOT a company/organization that developed the Mosaic Internet browser which is contradicted by the fact that their passage states that NCSA developed Mosaic.  One interesting thing to note here, is this type of proof would not work well for subjective things or on passages that are lying or incorrect.</p>
<p>Section 8 discusses results.  Namely, they claim a 30% improvement in correct answers for TREC 2002.  Quite significant results with the trade off lying in the time and effort it takes to both codify the prover and actually run it on real systems.  This is often the trade off in NLP systems in the current state of things.  It seems you can have either fast or good, but rarely can you have both when it comes to deep, complex problems such as QA.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/16/potw-31107-discussion-of-cogex-a-logic-prover-for-question-answering/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>POTW: 3/11/07: &#8220;COGEX: A Logic Prover for Question Answering&#8221; by Moldovan, et. al.</title>
		<link>http://www.paperoftheweek.com/2007/03/12/potw-31107-cogex-a-logic-prover-for-question-answering-by-moldovan-et-al/</link>
		<comments>http://www.paperoftheweek.com/2007/03/12/potw-31107-cogex-a-logic-prover-for-question-answering-by-moldovan-et-al/#comments</comments>
		<pubDate>Mon, 12 Mar 2007 12:02:38 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/12/potw-31107-cogex-a-logic-prover-for-question-answering-by-moldovan-et-al/</guid>
		<description><![CDATA[Following on from last weeks look at Language Computer Corporations TREC 2003 entry, we are going to dig deeper into the theorem prover part of the system and look at &#8220;COGEX: A Logic Prover for Question Answering&#8221; by Moldovan, et. al.]]></description>
			<content:encoded><![CDATA[<p>Following on from last weeks look at Language Computer Corporations TREC 2003 entry, we are going to dig deeper into the theorem prover part of the system and look at &#8220;<a href="http://citeseer.ist.psu.edu/rd/70536349%2C656408%2C1%2C0.25%2CDownload/http://citeseer.ist.psu.edu/cache/papers/cs/32721/http:zSzzSzacl.ldc.upenn.eduzSzNzSzN03zSzN03-1022.pdf/cogex-a-logic-prover.pdf">COGEX: A Logic Prover for Question Answering</a>&#8221; by Moldovan, et. al.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/12/potw-31107-cogex-a-logic-prover-for-question-answering-by-moldovan-et-al/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>POTW 3/4/07: Answer Mining by Combining Extraction Techniques with Abductive Reasoning by Harabagiu, et. al.</title>
		<link>http://www.paperoftheweek.com/2007/03/09/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al-2/</link>
		<comments>http://www.paperoftheweek.com/2007/03/09/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al-2/#comments</comments>
		<pubDate>Fri, 09 Mar 2007 21:37:56 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/09/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al-2/</guid>
		<description><![CDATA[This weeks paper, &#8220;Answer Mining by Combining Extraction Techniques with Abductive Reasoning&#8220;, lays out, at a high level, the capabilities of the highest performing QA system at TREC 2003, namely Language Computer Corporation&#8217;s QA system.  The first section or two lay out the groundwork for the competition, much as was already done in the Voorhees [...]]]></description>
			<content:encoded><![CDATA[<p>This weeks paper, &#8220;<a href="http://trec.nist.gov/pubs/trec12/papers/lcc.qa.pdf">Answer Mining by Combining Extraction Techniques with Abductive Reasoning</a>&#8220;, lays out, at a high level, the capabilities of the highest performing QA system at TREC 2003, namely <a href="http://www.languagecomputer.com/">Language Computer Corporation&#8217;s</a> QA system.  The first section or two lay out the groundwork for the competition, much as was already done in the Voorhees paper from last week.  The real meat of the paper starts in the section titled &#8220;The architecture of the QA system&#8221;.</p>
<p>The system is divvied up into several components, as displayed in Figure 1 of the document.  They are the question processing unit, document processing, factoid answer processing, list answer processing and definition answer processing.  All documents are processed in the same way and passages are retrieved based on the keywords in question.  Depending on the type of question, some passages are removed if they do not have the right answer type.  Passages having a higher number of expected answer types are favored for the list based approach.</p>
<p>For factoid questions, LCC used their CICERO LITE system to provide extractions and answers were calculated based on the extractions and/or the expected answer types.    The extraction process had to identify a variety of semantic classes, such as quantity, date, people, city, etc.  The paper then discusses the types of questions that were answered by these approaches, as well as some special case scenarios related to manner of death (kind of morbid, but death is often a fascination of this kind of research, in my experience).  From here, there is a discussion of the role of theorem proving in the algorithm (see page 5), but the details are left to another paper (guess what will be the paper next week?)  I must admit, I don&#8217;t fully understand page 5 and 6 just yet, mostly, I think, because I&#8217;m not familiar with the syntax they are using, so maybe reading the next paper will make it more clear.</p>
<p>Page 6 continues with discussion of finding answers for definition questions, which relies on a pattern matching approach to find answers based on 38 internally developed patterns, some of which are in Table 5 in the paper.</p>
<p>Page 7 finishes of with a discussion of their list based approach, which uses a threshold cutoff approach that determines the similarity between the first and last answers in the list, and all those in between, cutting off the answers when they reach a threshold.</p>
<p>The rest of the paper is performance evaluation and references.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/09/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>POTW 3/4/07: Answer Mining by Combining Extraction Techniques with Abductive Reasoning by Harabagiu, et. al.</title>
		<link>http://www.paperoftheweek.com/2007/03/04/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al/</link>
		<comments>http://www.paperoftheweek.com/2007/03/04/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al/#comments</comments>
		<pubDate>Mon, 05 Mar 2007 01:36:48 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/04/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al/</guid>
		<description><![CDATA[Building on our discussion of TREC 2003 QA track from last week, we will examine the top performing system from Language Computer Corporation via their paper: Answer Mining by Combining Extraction Techniques with Abductive Reasoning available from the TREC 2003 proceedings located here.]]></description>
			<content:encoded><![CDATA[<p>Building on our discussion of TREC 2003 QA track from last week, we will examine the top performing system from Language Computer Corporation via their paper: <a href="http://trec.nist.gov/pubs/trec12/papers/lcc.qa.pdf">Answer Mining by Combining Extraction Techniques with Abductive Reasoning</a> available from the TREC 2003 proceedings located <a href="http://trec.nist.gov/pubs/trec12/t12_proceedings.html">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/04/potw-3407-answer-mining-by-combining-extraction-techniques-with-abductive-reasoning-by-harabagiu-et-al/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>POTW 2/26/07: Overview of the TREC 2003 Question Answering Track by Voorhees</title>
		<link>http://www.paperoftheweek.com/2007/03/01/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees-2/</link>
		<comments>http://www.paperoftheweek.com/2007/03/01/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees-2/#comments</comments>
		<pubDate>Fri, 02 Mar 2007 02:16:20 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Natural Language Processing (NLP)]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/03/01/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees-2/</guid>
		<description><![CDATA[Overview of the TREC 2003 Question Answering Track by Voorhees is a nice introduction to the QA task, as defined by TREC. The first two sections layout the two main tasks: The Passages Task Main Task &#8211; Divided into three sub tasks Factoids Lists Definitions The passages task is used to test whether a QA [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.inf.ed.ac.uk/teaching/courses/tts/papers/QA.OVERVIEW.pdf">Overview of the TREC 2003 Question Answering Track</a> by Voorhees is a nice introduction to the QA task, as defined by TREC.  The first two sections layout the two main tasks:</p>
<ol>
<li>The Passages Task</li>
<li>Main Task &#8211; Divided into three sub tasks
<ol>
<li>Factoids</li>
<li>Lists</li>
<li>Definitions</li>
</ol>
</li>
</ol>
<p>The passages task is used to test whether a QA system can find factoids in fairly short spans of text (250 characters).  As compared to the factoids sub-task under the main task, this approach allows answers to be a little more &#8220;loose&#8221;.  Answers are judged correct if they contained the right answer, the document that they came from supports the answer and that the answer was &#8220;responsive&#8221;.  Responsive, in my understanding, means answers have the correct units or they refer to the actual item of interest and not a replica or imitation.  The evaluation judged the accuracy of the answers as the fraction of correct answers as determined by human judges.</p>
<p>The main task was broken up into three sub tasks.  Participants had to do all three tasks.  The factoid task is pretty much the same as the passage task, but more strict in that exact answers had to be returned, not just passages.  The list task requires systems to return one or more list of items that make up an answer.  For example, if the question was &#8220;what are the 50 states that make up the United States?&#8221; a correct answer would have to enumerate all 50 states.   Finally, the definition task required the system to define things, such as &#8220;Who is George Bush?&#8221; or &#8220;what is information retrieval?&#8221;  Much of section 2.3 details how the definition task was evaluated.  If anything, it makes you think about how you go about, as a person, finding a correct answer to a question.  Many people, I think, take for granted that google or something gives you a correct answer to your questions.</p>
<p>Section 4 is an extensive discussion of how the evaluation was completed.</p>
<p>In my opinion, having worked on a QA system, I would say there are a couple of other areas that are of interest in QA.  First, though, it should be noted that QA is a very hard problem, not only do most systems rely on an IR system for retrieval, which is less than perfect, but then you need to find the exact answer in the given passage.  At any rate, I also think there are several other areas that warrant research and are more interesting in some ways.  First, are questions of that require longer answers that may span multiple passages or that answer questions targeted towards a more demanding audience.  For instance, how could you answer &#8220;why, in scientific terms, is the sky blue?&#8221; in 250 characters or less?  Second, of interest are questions that require essay style answers or answers that fit into positive and negative sections, such as &#8220;what are the pros and cons of tariffs on China?&#8221;  Granted, these are much harder, but I think they are much more realistic in some ways.  To me, definition questions are fairly quickly answered by searches on Google or Wikipedia.  To some extent this is also true for list questions as well.</p>
<p>Now that we have a nice intro to the QA task, we will start to look into how people implement QA systems.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/03/01/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees-2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>POTW 2/26/07: Overview of the TREC 2003 Question Answering Track by Voorhees</title>
		<link>http://www.paperoftheweek.com/2007/02/26/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees/</link>
		<comments>http://www.paperoftheweek.com/2007/02/26/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees/#comments</comments>
		<pubDate>Mon, 26 Feb 2007 14:13:22 +0000</pubDate>
		<dc:creator>grant.ingersoll</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Question Answering]]></category>

		<guid isPermaLink="false">http://www.paperoftheweek.com/2007/02/26/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees/</guid>
		<description><![CDATA[This week&#8217;s paper can be found at http://www.inf.ed.ac.uk/teaching/courses/tts/papers/QA.OVERVIEW.pdf. This paper should provide us an introduction to the QA problem and provide some background on evaluation.  From here, we will start looking into the different approaches taken for question answering. natural language processing, QA, question answering Technorati Tags: natural language processing, QA, question answering]]></description>
			<content:encoded><![CDATA[<p>This week&#8217;s paper can be found at <a href="http://www.inf.ed.ac.uk/teaching/courses/tts/papers/QA.OVERVIEW.pdf">http://www.inf.ed.ac.uk/teaching/courses/tts/papers/QA.OVERVIEW.pdf</a>.</p>
<p>This paper should provide us an introduction to the QA problem and provide some background on evaluation.  From here, we will start looking into the different approaches taken for question answering.</p>
<p>natural language processing, QA, question answering</p>
<p>Technorati Tags: <a href="http://technorati.com/tag/natural+language+processing" rel="tag">natural language processing</a>, <a href="http://technorati.com/tag/QA" rel="tag">QA</a>, <a href="http://technorati.com/tag/question+answering" rel="tag">question answering</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.paperoftheweek.com/2007/02/26/potw-22607-overview-of-the-trec-2003-question-answering-track-by-voorhees/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
