<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Why We Need Open Search, and How to Make Money Doing It</title>
	<atom:link href="http://jonathanstray.com/why-we-need-open-search/feed" rel="self" type="application/rss+xml" />
	<link>http://jonathanstray.com/why-we-need-open-search</link>
	<description>Information, Culture, and Belief</description>
	<lastBuildDate>Tue, 24 Aug 2010 11:34:03 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: admin</title>
		<link>http://jonathanstray.com/why-we-need-open-search/comment-page-1#comment-599</link>
		<dc:creator>admin</dc:creator>
		<pubDate>Tue, 29 Sep 2009 02:24:10 +0000</pubDate>
		<guid isPermaLink="false">http://jonathanstray.com/?p=1023#comment-599</guid>
		<description>80legs may doing exactly what I want! Thanks so much, Neil!

Hmm... what do I want to build first?</description>
		<content:encoded><![CDATA[<p>80legs may doing exactly what I want! Thanks so much, Neil!</p>
<p>Hmm&#8230; what do I want to build first?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Neil Kandalgaonkar</title>
		<link>http://jonathanstray.com/why-we-need-open-search/comment-page-1#comment-598</link>
		<dc:creator>Neil Kandalgaonkar</dc:creator>
		<pubDate>Mon, 28 Sep 2009 23:06:03 +0000</pubDate>
		<guid isPermaLink="false">http://jonathanstray.com/?p=1023#comment-598</guid>
		<description>There&#039;s also 80Legs, a new startup which essentially rents crawling infrastructure at a relatively low fee. And another new startup, Spinn3r, tries to keep up with blog posts and Facebook activity in near-real-time.

Finally, there&#039;s archive.org. I interviewed there -- didn&#039;t work out, but I learned a lot about them. Until now they&#039;ve really seen their role as, well, an archive. They have a good amount of data, but they think about preserving web pages for the next 100 years, not opening it up to interactive experiments. Still it&#039;s a potential source of data...

Of course none of these are Google. One of the reasons why Google is miles ahead of the competition is that they&#039;ve worked out ways to make experiments on the dataset extremely cheap, at least internally. It&#039;s not impossible that they&#039;ll allow outsiders access to that for a fee, at some point.

But that is still nothing like a true public archive.</description>
		<content:encoded><![CDATA[<p>There&#8217;s also 80Legs, a new startup which essentially rents crawling infrastructure at a relatively low fee. And another new startup, Spinn3r, tries to keep up with blog posts and Facebook activity in near-real-time.</p>
<p>Finally, there&#8217;s archive.org. I interviewed there &#8212; didn&#8217;t work out, but I learned a lot about them. Until now they&#8217;ve really seen their role as, well, an archive. They have a good amount of data, but they think about preserving web pages for the next 100 years, not opening it up to interactive experiments. Still it&#8217;s a potential source of data&#8230;</p>
<p>Of course none of these are Google. One of the reasons why Google is miles ahead of the competition is that they&#8217;ve worked out ways to make experiments on the dataset extremely cheap, at least internally. It&#8217;s not impossible that they&#8217;ll allow outsiders access to that for a fee, at some point.</p>
<p>But that is still nothing like a true public archive.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
