By: Jonathan Stray » We Have No Maps of The Web

Jonathan Stray » We Have No Maps of The Web — Wed, 17 Aug 2011 04:16:11 +0000

[…] job — if they wanted to, or if they were willing to let others access their data. (Update: more on the economics of web indices.) But details follow need; like Stewart Brand, maybe we first need […]

By: admin

admin — Tue, 29 Sep 2009 02:24:10 +0000

80legs may doing exactly what I want! Thanks so much, Neil!

Hmm… what do I want to build first?

By: Neil Kandalgaonkar

Neil Kandalgaonkar — Mon, 28 Sep 2009 23:06:03 +0000

There’s also 80Legs, a new startup which essentially rents crawling infrastructure at a relatively low fee. And another new startup, Spinn3r, tries to keep up with blog posts and Facebook activity in near-real-time.

Finally, there’s archive.org. I interviewed there — didn’t work out, but I learned a lot about them. Until now they’ve really seen their role as, well, an archive. They have a good amount of data, but they think about preserving web pages for the next 100 years, not opening it up to interactive experiments. Still it’s a potential source of data…

Of course none of these are Google. One of the reasons why Google is miles ahead of the competition is that they’ve worked out ways to make experiments on the dataset extremely cheap, at least internally. It’s not impossible that they’ll allow outsiders access to that for a fee, at some point.

But that is still nothing like a true public archive.

Comments on: Why We Need Open Search, and How to Make Money Doing It

By: Jonathan Stray » We Have No Maps of The Web

By: admin

By: Neil Kandalgaonkar