<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: The Future of Search</title>
	<atom:link href="http://aabs.wordpress.com/2007/09/04/the-future-of-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/</link>
	<description>Closed weekends and holidays.</description>
	<pubDate>Sat, 05 Jul 2008 04:33:50 +0000</pubDate>
	<generator>http://wordpress.org/?v=MU</generator>
		<item>
		<title>By: Andrew Matthews</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14567</link>
		<dc:creator>Andrew Matthews</dc:creator>
		<pubDate>Thu, 06 Sep 2007 00:17:43 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14567</guid>
		<description>&lt;i&gt;I just received this comment via private email from an architect on the search team within Microsoft, who has asked to remain anonymous, but who has some very interesting ideas:&lt;/i&gt;

It will take some time to digest this article. I will comment on one point right now, however:

&lt;blockquote&gt;Search engines assume that if many others link to a page then it must be valuable and so rank it higher.&lt;/blockquote&gt;

This is way off base. Ranking is an extremely complex process, taking into account a wide range of variables, including inbound links, meta data, heuristics, the quality and nature of inbound links (just speculating, a link from Wikipedia might rank very highly compared to other links), and so on. There are also algorithms to detect bot-generated content. Query logs (what people search for and what they click (and whether they stay there or quickly come back)) are also bountiful sources of ranking algorithms. And these are just the obvious things that the big engines figured out years ago.

The most important challenge -- and competitive playing field -- is capacity, and not just to hold the web, but to compute on it. We hold tens of billions of documents, so even URI-only graph algorithms have to be distributed. That's why Google and Microsoft (and all the other transnats) are buying land near power stations as fast as they can. The computational platform itself, is also a key strategic piece. Whoever can more elegantly express and efficiently execute computations over petabytes of data has a significant competitive edge.</description>
		<content:encoded><![CDATA[<p><i>I just received this comment via private email from an architect on the search team within Microsoft, who has asked to remain anonymous, but who has some very interesting ideas:</i></p>
<p>It will take some time to digest this article. I will comment on one point right now, however:</p>
<blockquote><p>Search engines assume that if many others link to a page then it must be valuable and so rank it higher.</p></blockquote>
<p>This is way off base. Ranking is an extremely complex process, taking into account a wide range of variables, including inbound links, meta data, heuristics, the quality and nature of inbound links (just speculating, a link from Wikipedia might rank very highly compared to other links), and so on. There are also algorithms to detect bot-generated content. Query logs (what people search for and what they click (and whether they stay there or quickly come back)) are also bountiful sources of ranking algorithms. And these are just the obvious things that the big engines figured out years ago.</p>
<p>The most important challenge &#8212; and competitive playing field &#8212; is capacity, and not just to hold the web, but to compute on it. We hold tens of billions of documents, so even URI-only graph algorithms have to be distributed. That&#8217;s why Google and Microsoft (and all the other transnats) are buying land near power stations as fast as they can. The computational platform itself, is also a key strategic piece. Whoever can more elegantly express and efficiently execute computations over petabytes of data has a significant competitive edge.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Matthews</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14563</link>
		<dc:creator>Andrew Matthews</dc:creator>
		<pubDate>Wed, 05 Sep 2007 23:31:36 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14563</guid>
		<description>This article can also be found at http://blog.vortexdna.com/andrew-matthews-on-the-future-of-search/</description>
		<content:encoded><![CDATA[<p>This article can also be found at <a href="http://blog.vortexdna.com/andrew-matthews-on-the-future-of-search/" rel="nofollow">http://blog.vortexdna.com/andrew-matthews-on-the-future-of-search/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mills Davis</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14548</link>
		<dc:creator>Mills Davis</dc:creator>
		<pubDate>Wed, 05 Sep 2007 12:51:58 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14548</guid>
		<description>Currently, there are a number of ventures and product development efforts focused on adding linguistics to search (moving beyond keywords), semantic search (combining knowledge models with linguistics in the search),  vertical search (finding a particular kind of information or answer in a particular domain), media search (being able to search video, and other kinds of file formats) and so on.   There is also a trend towards question answering and what I would call semantic research rather than search that involves training on questions, briefs or documents on the fly to then read and extract specific information of interest and to then write and report syntheses of findings.</description>
		<content:encoded><![CDATA[<p>Currently, there are a number of ventures and product development efforts focused on adding linguistics to search (moving beyond keywords), semantic search (combining knowledge models with linguistics in the search),  vertical search (finding a particular kind of information or answer in a particular domain), media search (being able to search video, and other kinds of file formats) and so on.   There is also a trend towards question answering and what I would call semantic research rather than search that involves training on questions, briefs or documents on the fly to then read and extract specific information of interest and to then write and report syntheses of findings.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Matthews</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14529</link>
		<dc:creator>Andrew Matthews</dc:creator>
		<pubDate>Wed, 05 Sep 2007 00:08:30 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14529</guid>
		<description>Hi Sudha,

I'm an optimist - I don't think that there is a problem that we can't overcome, given sufficient incentive.</description>
		<content:encoded><![CDATA[<p>Hi Sudha,</p>
<p>I&#8217;m an optimist - I don&#8217;t think that there is a problem that we can&#8217;t overcome, given sufficient incentive.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sudha</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14526</link>
		<dc:creator>sudha</dc:creator>
		<pubDate>Tue, 04 Sep 2007 23:50:32 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14526</guid>
		<description>Wow, great insights into what lies ahead in future of search. Do we have more issues than we can solve?</description>
		<content:encoded><![CDATA[<p>Wow, great insights into what lies ahead in future of search. Do we have more issues than we can solve?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Matthews</title>
		<link>http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14496</link>
		<dc:creator>Andrew Matthews</dc:creator>
		<pubDate>Tue, 04 Sep 2007 13:42:46 +0000</pubDate>
		<guid isPermaLink="false">http://aabs.wordpress.com/2007/09/04/the-future-of-search/#comment-14496</guid>
		<description>This article was invited by Kaila at http://blog.vortexdna.com as party of their 'Dream team of Search' series. And is posted there too.</description>
		<content:encoded><![CDATA[<p>This article was invited by Kaila at <a href="http://blog.vortexdna.com" rel="nofollow">http://blog.vortexdna.com</a> as party of their &#8216;Dream team of Search&#8217; series. And is posted there too.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
