<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jonathan Stray &#187; technology</title>
	<atom:link href="http://jonathanstray.com/tag/technology/feed" rel="self" type="application/rss+xml" />
	<link>http://jonathanstray.com</link>
	<description>Information, Culture, and Belief</description>
	<lastBuildDate>Tue, 15 May 2012 20:13:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Learn to program, then and now</title>
		<link>http://jonathanstray.com/learn-to-program-then-and-now</link>
		<comments>http://jonathanstray.com/learn-to-program-then-and-now#comments</comments>
		<pubDate>Wed, 28 Sep 2011 23:42:15 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[computational journalism]]></category>
		<category><![CDATA[learning]]></category>
		<category><![CDATA[systems]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=3223</guid>
		<description><![CDATA[Learning to program a computer is hard. While you can learn to make useful things in a few months, mastery may take a decade. It&#8217;s not like learning to bake a cake or shoot a video. It&#8217;s more like learning to play a musical instrument. It takes years of practice to get really good &#8212; or [...]]]></description>
			<content:encoded><![CDATA[<p>Learning to program a computer is hard. While you can learn to make useful things in <a href="http://lifeandcode.tumblr.com/post/10596183761/learning-to-program-for-journalists-the-epic-howto">a few months</a>, mastery may take a <a href="http://norvig.com/21-days.html">decade</a>. It&#8217;s not like learning to bake a cake or shoot a video. It&#8217;s more like learning to play a musical instrument. It takes years of practice to get really good &#8212; or in the programmer&#8217;s case, tens of thousands of lines of production code. Meanwhile, you&#8217;re going to make the neighbors&#8217; ears bleed.</p>
<p>Why would anyone do this? I think the reasons people invest such insane amounts of time in such a specialized skill are shifting. And I think that shift is healthy. It&#8217;s a shift in what it means to be a technologist. And the culture of our technical communities is shifting with it.</p>
<p><strong>Back then</strong><br />
I learned to program in high school, early 90s. Looking back, I think my formative experiences as a technologist were pretty typical for my generation of programmers. I had three or four close friends in high school who also liked computers. They were all male. This was the dawn of the internet, around the time of the very first web browsers, and long before the first tech bubble made geeks into rich visionaries. We were not remotely cool. Technical information was somewhat harder to come by than today, but not excessively so. My first C++ compiler shipped in a big box which included a thick paper reference manual and a really nice language tutorial. We subscribed to Byte and Doctor Dobbs&#8217; Journal. We hacked on stuff at lunch time and after school and weekends, and traded shareware on floppies. The technology was different, but the substance of the experience was much the same as today. We spent a lot of time at the computer, and we were well-connected into a community of like minded people. The community provided technical help but also motivation and inspiration.</p>
<p>We weren&#8217;t trying to change the world.  We were driven by an intense curiosity about the inner workings of machines, and we wanted to be admired for being good at something. I wrote the Windows port of <a href="http://en.wikipedia.org/wiki/Netrek">Netrek</a>, one of the very first multiplayer online games, and the local geeks knew who I was when I arrived at the University of Toronto. This kind of experience persisted through my undergraduate years studying computer science. Long nights in the computer lab; cool hacks. There&#8217;s a wonderful <a href="http://en.wikipedia.org/wiki/Hackers:_Heroes_of_the_Computer_Revolution">book</a> which captures this culture as it evolved starting in the late 1950s.</p>
<p><strong>Enter women</strong><br />
There were no women in the communities where I learned to program. Or, almost none. I did a head count in one of my classes: four out of 150 students. Sadly, this kind of ratio persists <a href="http://twitter.com/#!/jonathanstray/status/114474779989377026">today</a> in many technical fields. I didn&#8217;t really know why this was. Us nerdy boys would have welcomed geeky girls. For all sorts of the right and wrong reasons.</p>
<p>It&#8217;s only in the last few years that I&#8217;ve started to understand why the dominant nerd culture drove women away in droves. Simply put: it was a club full of very poorly socialized boys, and our peer-based motivation was all about status. We all wanted to be the alpha geek. We would jump all over each other to point out errors. We would never miss a chance to demonstrate our superior, elegant technical minds. We were completely insufferable to anyone else.</p>
<p>Fortunately, there are now more women in tech. And they&#8217;re starting to tell their tale. While I don&#8217;t want to generalize too much from the experiences of a single person, I found the account of Rebekah Cox to be really enlightening (there are lots more great stories in the same <a href="http://www.quora.com/What-is-it-like-to-be-a-woman-working-in-the-tech-industry">thread</a>):</p>
<blockquote><p>So, if you enter this environment as a woman without any sort of agenda or understanding of this culture the first thing you find is that if you actually say something the most likely reaction is for a guy to verbally hit you directly in the face. To the guys this is perfectly normal, expected and encouraged behavior but to women this is completely out of nowhere and extremely discouraging.</p>
<p>As a technical woman, this is your introduction and the first thing you have to learn is how to get back up and walk right back into a situation where the likelihood of getting punished for participating is one. How you choose to react to this determines the rest of your career in technology.</p></blockquote>
<p>Now, I don&#8217;t want to give the wrong impression. It wasn&#8217;t all one-upmanship and verbal assaults. These geek scenes could also be wonderfully supportive, and often served as social groups too. You have to remember that this was before computers were cool, and it was an awkward adolescence when you were interested in things you couldn&#8217;t begin explain to anyone else. Also, it was a great learning environment. Cox again:</p>
<blockquote><p>Even the aforementioned nerd trash talk is actually a useful tool that can help you. The reason that culture exists is to make everyone in the group better. The fact that you are getting hit in the face means that someone is either wrong and you can hit back with a correct answer or that you are wrong and someone is letting you know that directly. Sticking that out means you are learning in an accelerated environment with instant correction.</p>
<p>Furthermore, if you stick around long enough, you can find people who aren&#8217;t completely insecure and are confident enough to not resort to insults to assert themselves. Those people make the tough environment actually tolerable. If you can help each other then you can establish a safer zone to talk through ideas. And since those more secure people are typically so secure because they are really, really good, you can find yourself in an informational jet-stream.</p></blockquote>
<p>In this artificial high-pressure environment we got good fast. But it was certainly off-putting to women, and not just women. Lots and lots of people wanted no part of this, and for good reason. Yet for quite a long time it was these sorts of socially dysfunctional communities that produced the lion&#8217;s share of the best technologists.</p>
<p><strong>Why program?</strong><br />
Learning to program is still ridiculously hard, and still requires a community of practice. And it still requires an absurd focus and motivation. But the sources of that motivation are broadening. I&#8217;ve been watching this shift for a while. The notion of programming for the social good has even crystalized into institutions such as <a href="http://www.rhok.org/">Random Hacks of Kindness</a> (for international development), <a href="http://hackshackers.com/">Hacks/Hackers</a> (for journalists), and <a href="http://codeforamerica.org/">Code for Amercia</a> (civic platforms.) For that matter, there&#8217;s Wikipedia. There are services and data all over the web. We don&#8217;t have to wonder whether software can change the world &#8212; it already has!</p>
<p>So by my old-school standards, the burgeoning hackers of today are very <em>applied</em>. I grew up desperately curious about the insides of things. Many of the programmers getting started now are far more extroverted than that. Here&#8217;s MIT Media Lab fellow <a href="http://lifeandcode.tumblr.com/post/10034972791/code-to-make-a-point-code-to-make-change">Lisa Williams</a>:</p>
<blockquote><p><em><strong>I want to learn to code because a lot of things piss me off.  </strong></em></p>
<p>I believe a program can stand in opposition to Things That Suck, just like a documentary, a work of art, or a protest march.</p>
<p>&#8230;</p>
<p>I wanna code because <strong>SHIT IS BROKEN.</strong>  I want to code because <a href="http://sunfoundation.tumblr.com/post/9887265525/the-supercommittees-k-street-connections-nearly">corruption is real</a>, <a href="http://www.businessinsider.com/satellite-tour-foreclosure-cities-2011-1?slop=1#boise-idaho-1-in-21-homes-in-foreclosure-the-red-dots-show-foreclosures-1">because people are getting thrown out of their houses</a>, because veterans aren’t getting what they deserve, because <a href="http://www.healthbeatblog.org/2007/10/race-and-health.html">racism is real and has real effects</a>, because <a href="http://transport.kurtraschke.com/2011/05/gtfs-visualize-service">yes it does matter when you cancel a bus line</a>, <a href="http://sunfoundation.tumblr.com/post/9922499027/ilovecharts-ben-greenman-via-mcsweeneys">because it’s really hard to shut a computer program up</a>, because <a href="http://www.palmbeachpost.com/news/map-acreage-cancer-cluster-confirmed-211098.html">you can’t say it’s an isolated incident when there’s a bigass Google Map in your face showing you it’s not</a>.</p></blockquote>
<p>This is Lisa demanding &#8220;<a href="http://jonathanstray.com/a-computational-journalism-reading-list">computational journalism</a>.&#8221; But pretty much every field of human endeavor uses lots and lots of software now. Software not only determines what is possible, in many ways, but what is not possible: <a href="http://harvardmagazine.com/2000/01/code-is-law.html">code-as-law</a>. It&#8217;s part of the system, and if you want to hack the system, well, at some point someone has to hack the code. That person could be you.</p>
<p><strong>Today</strong><br />
At the Online News Association conference last week, I ran into Michelle Minkoff and Heather Billings standing in front of couple dozen enthusiastic young journalists who had gathered in the hallway to hear about programming. Michelle works with me in the Interactive department at the AP, while Heather just started at the Chicago Tribune. Both are fearsome technologists, though I don&#8217;t think either would be offended if I said they are still near the beginning of their journey. That makes them the perfect people to talk to about learning to program.</p>
<p>Most of the people attending had some programming experience, but not much. There were 24 people listening to Michelle and Heather, 9 of whom were female. A great improvement. I sat in on this conversation for a while. It wasn&#8217;t what I was expecting. No code. Very little technical discussion at all actually.  One woman said she knew enough Python to write a Hangman game. &#8220;Great!&#8221; said Michelle. &#8220;You&#8217;re ready to learn <a href="https://www.djangoproject.com/">Django</a>!&#8221;</p>
<p>I guess I&#8217;m surprised anyone has to be told that they are ready to learn to program. But inclusion and connection was a major theme in the discussion. Here are some of the snippets of conversation I wrote down:</p>
<blockquote><p>&#8220;You can make an anonymous account on StackOverflow and ask stupid questions.&#8221;</p>
<p>&#8220;Connect in person, build that mentor relationship.&#8221;</p>
<p>&#8220;But that documentation is for developers!&#8221;</p></blockquote>
<p>This was a group of people who needed to be told that they could learn to program. That they could be <em>one of them</em>. This is understandable. When you can&#8217;t begin decipher the supposed &#8220;instructions,&#8221; technology can seem like an occult priesthood. But you don&#8217;t need <em>them</em>. You just need to want to do it, really badly, and you need to find some other people who want to do it badly too (and obviously, expect to meet these people online.) Then <em>one of them</em> becomes <em>one of us</em>. Of course you can learn to program. It just takes a while, and a stupid amount of practice.</p>
<p>In fact it&#8217;s probably necessary to devote a few years of your life to it full time. That&#8217;s one of the advantages of a computer science degree &#8212; time to focus. Also, a CS degree is a fast track to the deep theory of computation; if you find yourself looking at programming languages and asking why they are the way they are, or staring hungrily across the awesome gap between your web apps and a search engine, you probably want to learn computer science, and formal education is one way to do that. But CS theory won&#8217;t make you a programmer. Only programming will do that.</p>
<p>Every truly good programmer I have known had some period of their life where they thought of nothing but code. Something around a year or two. It&#8217;s got to get under your skin at some point. I call this the hacker gestation period. You&#8217;ll know you&#8217;ve reached the other side of it, because software will stop being mysterious. Eventually code becomes clay.</p>
<p>And this formative period is why it&#8217;s so important to have a community. You&#8217;re going to need friends who are interested in talking about geeky stuff. You&#8217;ll be so excited about it for a while that you won&#8217;t be able to talk about much else. (Really. If this is not the case, go do something else. Programming takes so much soul that you&#8217;re going to hate your life if you don&#8217;t genuinely enjoy it.) Your community will help you when you get stuck, and they will help you develop your sense of style. Code is the most obscure art, because only another programmer can see all the layers of beauty in a truly lovely piece of code. But it&#8217;s very hard to become an artist alone, without influences and critics.</p>
<p>So it takes a village to make a programmer. I won&#8217;t say that our technical villages are now inhabited by &#8220;normal&#8221; people, by any stretch of the imagination, but the communities where programmers are now growing up seem far more diverse, supportive, and extroverted than in years past.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/learn-to-program-then-and-now/feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>The world cannot be represented in machine-readable form</title>
		<link>http://jonathanstray.com/the-world-cannot-be-represented-in-machine-readable-form</link>
		<comments>http://jonathanstray.com/the-world-cannot-be-represented-in-machine-readable-form#comments</comments>
		<pubDate>Thu, 15 Apr 2010 08:54:41 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=1789</guid>
		<description><![CDATA[UPDATE: Debrouwere continues the conversation with a response to the key points here, in the comments to his original post. Dutch journalist/coder Stijn Debrouwere has written a very thorough post describing the ways in which standard tags, like the ones on this blog or on Flickr, fall short when applied to news articles. There are lots of [...]]]></description>
			<content:encoded><![CDATA[<p><strong>UPDATE: </strong>Debrouwere continues the conversation with a response to the key points here, in the comments to his original <a href="http://stdout.be/2010/tags-dont-cut-it">post</a>.</p>
<p>Dutch journalist/coder Stijn Debrouwere has written a very thorough <a href="http://stdout.be/2010/tags-dont-cut-it">post</a> describing the ways in which standard tags, like the ones on this blog or on Flickr, fall short when applied to news articles. There are lots of things we might like to know about a story, such as where and when it happened and who was involved. This additional information, sort of like the index to a book, is known as &#8220;metadata&#8221;, and there is within the online journalism community a great call for its use, including by Debrouwere:</p>
<blockquote><p>Each story could function as part of a web of knowledge around a certain topic, but it doesn’t.</p>
<p>So here’s a well-intentioned idea you’ve heard before: journalists should start tagging. <a href="http://twitter.com/jayrosen_nyu">Jay Rosen</a> insists that “<a href="http://journalism.nyu.edu/pubzone/weblogs/pressthink/2010/03/07/what_i_plan_to.html">Getting disciplined and strategic about tagging</a>&#8221; may be one way professional journalism separates itself from the flood of cheap content online.” Tags can show how a news article relates to broader themes and topics. Just the ticket.</p></blockquote>
<p>News metadata is a major topic, and many people have speculated deeply about the value of creating news metadata at the time of reporting, such as the ever-sarcastic <a href="http://xark.typepad.com/my_weblog/2009/05/the-lack-of-vision-thing-well-heres-a-vision-for-you.html">Xark</a> and the thoughtful Martin Belam who writes about <a href="http://www.guardian.co.uk/help/insideguardian/2010/jan/25/news-linked-data-summit">why &#8220;linked data&#8221; is good for journalism</a>. But I&#8217;m going to respond to Debrouwere because I read him today, because he has lovely diagrams that explain his good ideas, and because, in criticizing &#8220;tags&#8221; as a form of metadata, I think he misses some very important points.</p>
<p>And he&#8217;s not alone. My sense is that many of the coder-journalists of today have not learned from the mistakes of generations of technically-minded people who wished to talk about the world in more precise ways.</p>
<p>Moving forward from simple tagging, Debrouwere imagines more sophisticated annotation schemes that start to pick up on what the tags actually mean. For starters, the tags could be drawn from separate &#8220;vocabularies.&#8221; Does a tag refer to a person, or a place, or perhaps an event? Debrouwere uses the following picture, which I&#8217;m going to borrow here because it explains the idea so nicely:</p>
<p style="text-align: center;"><a href="http://stdout.be/2010/tags-dont-cut-it/"><img class="size-full wp-image-1791   aligncenter" title="5-types-of-relationships" src="http://jonathanstray.com/wp-content/uploads/2010/04/5-types-of-relationships.png" alt="5-types-of-relationships" width="520" height="477" /></a></p>
<p>But, he says, we can get even more sophisticated. What did the story actually say? If it mentioned a person, what did it say about them? Was it an interview? A profile? Did it criticize them? Here&#8217;s the diagram he draws:<span id="more-1789"></span></p>
<p style="text-align: center;"><a href="http://jonathanstray.com/wp-content/uploads/2010/04/relationship-to-person1.png"><img class="size-full wp-image-1790 aligncenter" title="relationship-to-person1" src="http://jonathanstray.com/wp-content/uploads/2010/04/relationship-to-person1.png" alt="relationship-to-person1" width="520" height="271" /></a></p>
<p>He imagines using this information to perform chains of inferences, like so:</p>
<blockquote><p>Barack Obama belongs to the Democratic Party and he’s from Chicago. If we tag an article with Barack Obama, it’s likely that the article also has something to do with the Democratic Party. If we’ve specified that the article is about Obama, and we’ve specified that Obama is part of the DP, the system now has all the necessary information to suggest our article about Obama as a possibly interesting related read on the topical page for the democratic party, even if we didn’t explicitly indicate that link.</p></blockquote>
<p>First of all, note that this sort of thing is already possible, quite often, using tags as they exist today. Simple analysis of co-tagging information will tell us that Obama is related to the Democratic party, because many articles will be tagged with both. Which is not to say that encoding such relationships explicitly isn&#8217;t a good idea. We can do this sort of thing using &#8220;triples,&#8221; which are fundamental to the nascent evolution of the internet into a <a href="http://blog.ted.com/2009/03/tim_berners_lee_web.php">web of &#8220;linked data&#8221;</a>:</p>
<ul>
<li>
<pre>&lt;Barack Obama&gt; belongs-to-party &lt;Democratic Party&gt;</pre>
</li>
</ul>
<p>Here, &#8220;Barack Obama&#8221; is an object from, say, the &#8220;people&#8221; vocabulary, and &#8220;Democratic Party&#8221; is from, perhaps, the &#8220;political party&#8221; vocabulary, or maybe just from &#8220;groups.&#8221; Essentially, these are tags that have been pre-categorized. The relationship between the two is expressed by the &#8220;belongs-to-party&#8221; <a href="http://www.cs.odu.edu/~toida/nerzic/content/logic/pred_logic/predicate/pred_intro.html">predicate</a>.</p>
<p>But I argue that this is a rigged example. The world is normally much more messy.</p>
<p>&#8220;Are you now or have you ever been a member of the communist party?&#8221; was a killer question <a href="http://en.wikipedia.org/wiki/Mccarthyism">in its day</a>, with complex answers like &#8220;I only attended one meeting.&#8221; And if parsing politician&#8217;s statements was easy, then <a href="http://www.politifact.com/">Politifact</a> wouldn&#8217;t devote entire articles to the question of whether a single sentence was true or false. Further, they distinguish between different &#8220;grades&#8221; of truth, like &#8220;<a href="http://www.politifact.com/truth-o-meter/statements/2010/apr/12/robert-gates/gates-said-leaked-military-video-shooting-iraq-doe/">mostly true</a>.&#8221; Mathematical logic &#8212; which is what the sort of news inferences that Debrouwere and others discuss is based on &#8212; doesn&#8217;t deal with &#8220;mostly true.&#8221;</p>
<p>The problem is that the world is not neatly categorizable.</p>
<p>Don&#8217;t get me wrong &#8212; vocabularies and relationships (ala linked data triples) are surely a good idea. But they have some serious drawbacks that relate to very deep issues in knowledge representation.</p>
<p>Debrouwere says, &#8220;Events happen at a certain place and at a certain time.&#8221; Sometimes. For a house fire or a shooting, maybe, but how &#8220;long&#8221; were the post-election protests in Iran last summer? They continued at varying intensity for several days, then flared up weeks later. Was that one protest or two? And what about a Facebook protest that gathered supporters over the course of a week? &#8220;When&#8221; and &#8220;where&#8221; did that happen?</p>
<p>Or, take the example of describing what an article says about someone. How do we decide when a story &#8220;criticizes&#8221; someone? There will always be boundary cases &#8212; lots of them in professional reporting. How do we ensure <a href="http://en.wikipedia.org/wiki/Inter-rater_reliability">inter-rater reliability</a>? Can we extract any real data from analyses of this tag if we have no other reference points with which to interpret it?</p>
<p>Something is always lost in categorization. That is the point! To say that two things are <em>like</em> one another is to ignore their differences, for the purposes of the present discussion. Unfortunately, what can safely be ignored depends on the discussion. Simple date and place notations work for some purposes, and fail miserably for others. They are not very rich, and even worse, we don&#8217;t know exactly how much has been lost in each case. Knowledge of that error is sometimes critical, especially when trying to make chains of inferences, where errors multiply.</p>
<p>The reason we use text for reporting is that it&#8217;s good at representing these sorts of ambiguities. Strict adherence to the religion of finite relationship vocabularies leads one to believe that the world can be modeled in <a href="http://en.wikipedia.org/wiki/First_order_logic">first-order logic</a> (predicate logic), and this just isn&#8217;t true. Chains of automatic inference fail very quickly when applied even to very simple &#8220;real world&#8221; situations. The Artificial Intelligence research community went down <a href="http://en.wikipedia.org/wiki/GOFAI">that road</a> for decades and found it really problematic, which is why we&#8217;re now seeing the rise of &#8220;statistical&#8221; AI techniques, such as <a href="http://en.wikipedia.org/wiki/Statistical_machine_translation">statistical machine translation</a>. This approach tries to find patterns in vast amounts of data rather than working out hard underlying rules; the categorization comes after you look at all available data, not before.</p>
<p>And therein lies the great virtue of tags: they are just about the simplest possible way of saying something, and don&#8217;t imply or require any particular inferential framework. They&#8217;re much harder to get wrong than more complex associations, and they make sense only in aggregate, and this makes them much more robust than predicate sentences. A tag says only, &#8220;there&#8217;s some association.&#8221; Full stop. I find this ambiguity a virtue. The meaning comes out of the relationships between the tags, articles, and users. Meaning is always relative, and tags force us to understand this, because there&#8217;s nothing else to go on.</p>
<p>Tags allow (or force) what we might call the &#8220;Google solution&#8221;: let humans describe it in a way that makes sense to them, then sort it all out later algorithmically. There are limits to this, of course, which is why metadata has value. But ultimately, computers serve humans, so the Google solution will always be a win when it is possible.</p>
<p>Linked data will be valuable because of the links. I predict that its main use will be as a sort of &#8220;super tagging&#8221; system: we still have &#8220;tags&#8221; in the linked data world,  it&#8217;s just that they&#8217;re now all &#8220;<a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">uniform resource identifiers</a>&#8221; that are visible to everyone on the web. This means that tags can be shared between systems and maintained by communities, which only makes them more powerful. In fact, this is exactly what we&#8217;re already seeing, with the Wikipedia-derived <a href="http://en.wikipedia.org/wiki/Dbpedia">DBPepdia</a> at the center of all those linked data &#8220;<a href="http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2009-02-27.png">bubble diagrams</a>.&#8221;</p>
<p>Linked data also supports predicates that  say what the relationship between the tags is, like the &#8220;Barack Obama is a member of the Democratic Party&#8221; example. But I predict that these will  be much less useful, offering almost none of the &#8220;machine understanding&#8221; that&#8217;s supposed to come with the semantic web. I don&#8217;t know what &#8220;understanding&#8221; means if not the ability to draw inferences of some sort, and predicates are just too fragile, too subject to mis-categorization, too limited to capture the rich relationships of the real world. I do believe that we&#8217;ll see amazing new &#8220;artificial intelligence&#8221;-like applications built on top of linked data, but they&#8217;ll be built statistically: they&#8217;ll ignore the predicates or use them only in special cases, or only in aggregate.</p>
<p>Having said all this, I am fully in support of adding better metadata to news stories. I believe the  &#8221;entity recognition&#8221; performed by <a href="http://www.opencalais.com/">OpenCalais</a> is valuable, and that carefully managed tag vocabularies are essential. Often &#8220;location&#8221; will be a genuinely useful tag, and I can see the possibility for some <a href="http://www.suburbified.com/">wonderful news mashups</a>.</p>
<p>But please, let&#8217;s not imagine that we can capture even the &#8220;essential&#8221; details of real journalism with any fixed vocabulary. And let&#8217;s not oversell the potential of machine reasoning or data-mining based on carefully-annotated news metadata.</p>
<p>We&#8217;re a very long way from understanding how to represent reality in machine-readable form.</p>
<p>For more on this topic, I recommend:</p>
<ul>
<li>&#8220;<a href="http://www.shirky.com/writings/ontology_overrated.html ">Ontology is overrated</a>&#8221; by Clay Shirky</li>
<li>&#8220;<a href="http://www.well.com/~doctorow/metacrap.htm">Metacrap</a>&#8221; by Cory Doctorow</li>
<li>&#8220;<a href="http://groups.csail.mit.edu/medg/ftp/psz/k-rep.html">What is a knowledge representation?</a>&#8221; by  Davis et al. at MIT</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/the-world-cannot-be-represented-in-machine-readable-form/feed</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>When you have to freeze your phone, technology is too complex</title>
		<link>http://jonathanstray.com/technology-is-way-too-complex-28</link>
		<comments>http://jonathanstray.com/technology-is-way-too-complex-28#comments</comments>
		<pubDate>Mon, 15 Feb 2010 17:20:46 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[frustration]]></category>
		<category><![CDATA[iphone]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=1601</guid>
		<description><![CDATA[My iPhone seemed to work better on ice, so I spent the last two hours alternating between chilling it in the freezer and pressing buttons. The WiFi kept cutting out, and I read somewhere that one of the failure modes for the iPhone radio was thermal. Amazingly enough, it worked, and the WiFi would run [...]]]></description>
			<content:encoded><![CDATA[<p>My iPhone seemed to work better on ice, so I spent the last two hours alternating between chilling it in the freezer and pressing buttons. The WiFi kept cutting out, and I read somewhere that one of the failure modes for the iPhone radio was thermal. Amazingly enough, it worked, and the WiFi would run for maybe three minutes after ten minutes of chilling. I desperately needed it to work, because my 3G service was down until I could install Ultrasn0w, the iPhone unlocking software. Which can only be installed by a program called Cydia, which only downloads new software over a WiFi network. I have to use unlocking software in the first place because US model iPhones are keyed to work only with AT&#038;T, which doesn&#8217;t exist in Hong Kong. I successfully unlocked my phone months ago, and everything was working fine until I upgraded the firmware, which I did in the hopes of fixing the WiFi which failed last week.</p>
<p>If you didn&#8217;t follow that, consider yourself fortunate. You&#8217;ve never needed to wonder about such things.</p>
<p>It gets better. When I reset my phone it lost the WiFi password to my home network. I couldn&#8217;t find it written down. I couldn&#8217;t remember the password to log into my router to look it up. The internet told me how to reset the router at the hardware level, but to reconfigure the wireless I&#8217;d need to connect my laptop to it with a cable. Which I didn&#8217;t have. Luckily, I eventually remembered the router password.</p>
<p>I started drinking.</p>
<p>Password problem solved, every ten minutes I&#8217;d open the freezer door, reset the WiFi on my phone, wait for Cydia to download its package list, then tell it to download the mere 50kb of Ultrasn0w and hope to hell the radio didn&#8217;t blink out in the middle of the tiny transfer. Now I know exactly how many bars I get in the back of the freezer.</p>
<p>After eight or nine tries,  I opened the freezer door to find my phone on the 3G network. Success! </p>
<p>Actually, it was way more involved than this. I left out a bunch of steps, all the things I tried that didn&#8217;t work. And of course the firmware upgrade did not fix the WiFi, so this experiment put me right back were I started and wasted six hours of my life and two tumblers of rather nice whiskey. At least I didn&#8217;t have to go out of my way to retrieve the ice. </p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/technology-is-way-too-complex-28/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>iPhone Augmented Reality Arrives &#8212; But When Will We Make Art With It?</title>
		<link>http://jonathanstray.com/but-when-will-they-make-art-with-it</link>
		<comments>http://jonathanstray.com/but-when-will-they-make-art-with-it#comments</comments>
		<pubDate>Mon, 05 Oct 2009 17:03:27 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[art]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[iphone]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=1068</guid>
		<description><![CDATA[Last year I imagined an iPhone app that superimposed virtual objects over video from the phone&#8217;s camera. With the advent of the iPhone 3GS and its built-in compass, it&#8217;s now happening. This video shows NearestWiki, which tags nearby landmarks/objects and guides you to them. I am aware of a few other AR apps, as this [...]]]></description>
			<content:encoded><![CDATA[<p>Last year I <a href="http://jonathanstray.com/cyberspace-is-everting">imagined</a> an iPhone app that superimposed virtual objects over video from the phone&#8217;s camera. With the advent of the iPhone 3GS and its built-in compass, it&#8217;s now happening.</p>
<p style="text-align: center;"><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="560" height="340" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/SCm9PZfT-YY&amp;hl=en&amp;fs=1&amp;" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="560" height="340" src="http://www.youtube.com/v/SCm9PZfT-YY&amp;hl=en&amp;fs=1&amp;" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>This video shows NearestWiki, which tags nearby landmarks/objects and guides you to them. I am aware of a few other AR apps, as this <a href="http://mashable.com/2009/08/19/augmented-reality-apps/">post on Mashable</a> and this <a href="http://hosted.ap.org/dynamic/stories/U/US_TEC_AUGMENTED_REALITY?SITE=AP&amp;SECTION=HOME&amp;TEMPLATE=DEFAULT">AP story</a> discuss. Many of these apps do building/object recognition, and one even <a href="http://www.youtube.com/watch?v=tb0pMeg1UN0">recognizes faces</a> and displays a sort of business card. We&#8217;re already seeing annotation with data from Wikipedia, Twitter and Yelp, and I suspect that we&#8217;re going to see these tools get very deep in the very near future, with Wikipedia-style tagging of  the entire history and context of any object.</p>
<p>Just a moment while I get over the fact that the future is already here.</p>
<p>Ok, I&#8217;m properly jaded again. Yeah, it&#8217;s an app platform, and that&#8217;s cool &#8212; but imagine the possibilities for art. Bets on who&#8217;s going to make the first &#8220;alternate reality spyglass&#8221; piece? Bets on how much <a href="http://en.wikipedia.org/wiki/Matthew_Barney">Matthew Barney</a> will sell it for in the app store?</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/but-when-will-they-make-art-with-it/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Self-Replicating Desktop Manufacturing: Dreams and Reality</title>
		<link>http://jonathanstray.com/self-replicating-desktop-manufacturing</link>
		<comments>http://jonathanstray.com/self-replicating-desktop-manufacturing#comments</comments>
		<pubDate>Fri, 31 Jul 2009 16:33:35 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=838</guid>
		<description><![CDATA[The ambition of the RepRap project (&#8220;replicating rapid-prototyper&#8221;) is undeniably cool: to design a machine which is essentially a self-replicating 3D printer. By building up objects layer by layer, rapid prototyping technology can be used to manufacture the parts for just about any simple object or machine. It would be like having your own little [...]]]></description>
			<content:encoded><![CDATA[<p>The ambition of the <a href="http://reprap.org/"><span style="color: #4f218d;">RepRap project</span></a> (&#8220;replicating rapid-prototyper&#8221;) is undeniably cool: to design a machine which is essentially a self-replicating 3D printer. By building up objects layer by layer, rapid prototyping technology can be used to manufacture the parts for just about any simple object or machine. It would be like having your own little factory in exactly the same way that having a laser printer is like having your own printing press, except that you can use this little factory to make another factory to give to your friend.</p>
<p>Theoretically, desktop manufacturing technology then spreads exponentially, until everyone can make whatever material objects they need from downloaded plans, for only the cost of feed plastic.</p>
<p>The dream is best explained in this excellent little video:</p>
<p style="text-align: center;"><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="400" height="320" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://vimeo.com/moogaloop.swf?clip_id=5202148&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" /><embed type="application/x-shockwave-flash" width="400" height="320" src="http://vimeo.com/moogaloop.swf?clip_id=5202148&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>It&#8217;s hard to overstate the fundamental shift that would come with truly widespread desktop manufacturing. Right now all of the objects we use are manufactured somewhere far away and shipped to us, and the designs are expensive and slow to  change. Instead, imagine if everyone had a household appliance, perhaps fed by spools of plastic and metal wire, that could manufacture just about any object from plans downloaded from the internet. It&#8217;s hard to see how private designs could compete with millions of amateur object designers geeking out over their widgets for the benefit of humanity, which means that designs for all the basic desirable objects would be freely available.</p>
<p>Want a new phone? Download the latest Android phone plan from the <a href="http://en.wikipedia.org/wiki/Open_Handset_Alliance">Open Handset Foundation</a>. That&#8217;s cool, but the really cool thing is this: everyone in the world could have one for the price of plastic. More to the point, everyone in the world could have e.g. irrigation pumps, car parts, light switches, medical devices, essentially all the trappings of modern technology.</p>
<p>It is of course debatable whether or not an increase in humanity&#8217;s use of energy-consuming technology is a good idea at this time. However, it seems to me unconscionable to deny it to the world&#8217;s poor just because we got there first. Further, one could also replicate the parts for <a title="The Gasification Experimenter's Kit" href="http://www.gekgasifier.com/">home biomass reactors</a>, electric cars, and other advanced energy devices &#8212; regardless of whether or not anyone can make a profit selling such items commercially.</p>
<p>New versions of the replicator with enhanced production capabilities (now with integrated circuits!) would be designed to be manufacturable using existing models. This means that manufacturing technology would itself spread virally. To bootstrap this, all you need are a few basic self-replicating machines, then the technology passes from friend to friend until the whole world is saturated and capable of producing all future upgrades.</p>
<p>But we are nowhere near that dream. There&#8217;s a lot of promise to desktop manufacturing, but I&#8217;ve come to believe that the RepRap approach is probably not the right one. And I&#8217;m going to try to explain why.</p>
<p><span id="more-838"></span></p>
<p>Back to reality. Today the RepRap team has succeeded in designing and building a cheap 3D printer which prints in plastic only and can produce about 50% of  its own parts. This is a historic event, and should not be underestimated. However, producing a new machine still requires a lot of basic hardware such as metal rods and screws, and also more exotic components such as specific integrated circuits and stepper motors. In RepRap&#8217;s evolutionary analogy, these raw parts (as well as plastic filament feedstock) are the naturally produced &#8220;vitamins&#8221; that the RepRap consumes from its environment in order to reproduce. As time goes on, the team hopes to produce designs for upgraded RepRap machines that can manufacture more of their own parts, and not incidentally the parts for more complex objects too. For example, they hope to be able to deposit metal films with the next generation machine, which would allow the RepRap to produce electrical wiring and basic circuits.</p>
<p>All very lovely, but it&#8217;s time to examine the reality of this technology. Today we have a prototype design, and a vibrant community of people experimenting with and working on self-replicating desktop manufacturing. Good. But we are not by any means on the threshold of a viral explosion of manufacturing capability, because the machines are not self-replicating exponentially as hoped. A very <a href="http://www.3DReplicators.com/cgi-bin/cblog/index.php?/archives/499-After-Darwin-Should-Mendel-be-a-specific-3D-printer-or-a-technology-toolbox.html">insightful post</a> from a site called <a href="http://3DReplicators.com/index.htm">The Clanking Replicator</a> explains the situation:</p>
<blockquote><p>By the second quarter of 2008 Vik Olliver had managed to print a full parts set for a Darwin with his own Darwin machine. Then a very curious and totally unexpected thing happened. Fully 6 months went by before a Darwin replicated again, this time in Canada. By that time, however, by Dr. Bowyer&#8217;s estimate of the population of Darwins was in the low thousands. What had happened?</p>
<p>Basically, Darwin morphed into a fully industrial product. It began with the controller boards being outsourced for production by the Reprap foundation and has culminated with a shippable kit purchasable for US$1,100  requiring little more than the sort of assembly you&#8217;d be expected to apply to something bought from Ikea. What is getting built out there in its thousands, to use Dr. Bowyer&#8217;s metaphor, is 100% vitamins &#8211; 0% replicated parts.</p></blockquote>
<p>The initial self-replicating manufacturing machine, code named Darwin, has so far completely failed to spread virally. People are building this prototype device, but not by using another 3D printer. I want to examine why, and what this means for the future of desktop manufacturing.</p>
<p>First of all, the machine is far from &#8220;self replicating&#8221; from the user&#8217;s point of view. What you get when the existing design &#8220;replicates&#8221; is a set of plastic parts for a new machine. To this must be added metal hardware, integrated circuits and electronics, and stepper motors. Then the whole must be assembled by someone already skilled in making machines (here I must disagree with the Clanking Replicator post to say that assembling a RepRap still seems to me quite a complex undertaking, the sort of thing you wouldn&#8217;t want to attempt without jeweler&#8217;s screwdrivers and a multimeter.) This is hardly a consumer item. To press the evolutionary analogy, its niche is limited to hardcore geeks. This might still allow exponential growth to saturation of that niche, but desktop manufacturing is not going to transform the world until it goes solidly mainstream.</p>
<p>This means a consumer product. The RepRep must be no harder to reproduce and assemble than Ikea furniture; if the directions are longer than a page, you&#8217;re going to lose 95% of your market immediately. Further, the parts that cannot be desktop manufactured must be ridiculously common. You need to be able to get them at the hardware store, even in places where hardware stores are very limited (especially in such places, if we&#8217;re seriously going to consider transforming Africa.) Stepper motors are far too difficult to obtain, even in developed countries.</p>
<p>The RepRap team recognizes this, and is trying hard to make the machine simpler to obtain in at least two ways. First, the second generation system promises to be simpler, with fewer parts, a more robust mechanical design, and easier assembly. Second, they are looking into ways to expand the types of parts that the RepRap can print. Metal film deposition is an obvious way to go, because then the RepRap could print its own wiring and circuit boards. With time, the team hopes to further simplify the replication process.</p>
<p>Except, why not just buy the parts as a kit? Or even fully assembled as an industrial product? Although the exponential replication story is a beautiful solution to the problem of distributing desktop manufacturing technology, do we really care? Modern civilization is already extremely good at getting an object into the hands of absolutely everyone, everywhere. The towns ringing the Sahara desert may not have electricity, but they sure as hell have Coca-Cola, and usually motorcycles too. No one not a geek is going to care about trying to self-replicate a machine until that process is easier (and cheaper) than buying a finished model at Wal-Mart.</p>
<p>Which brings us to the second major problem I see with the RepRap concept: none of it matters at all until the thing is actually useful. This means making thing that people want <em>other than parts for more RepRaps</em>. Under the heading <a href="http://www.reprap.org/bin/view/Main/ItemsMade">What Can It Make?</a> the RepRap website shows us a fly swatter, a pair of child&#8217;s plastic sandals, a coat hook, an iPhone-to-dashboard mounting bracket, a strainer, a plastic ring, various brackets, a couple of gears, and a crappy martini glass. I understand that this is first generation technology, and it <em>is </em>very cool to make this stuff at home, but is there really any non-geek demand for a machine that can make these sorts of objects?</p>
<p style="text-align: center;"><a href="http://jonathanstray.com/wp-content/uploads/2009/07/vik-glass-3-small.jpg"><img class="size-medium wp-image-847 aligncenter" title="vik-glass-3-small" src="http://jonathanstray.com/wp-content/uploads/2009/07/vik-glass-3-small-300x225.jpg" alt="vik-glass-3-small" width="300" height="225" /></a></p>
<p>In my opinion, the RepRap community has so far focussed far too much on the coolness of a self-replicator, and not nearly enough on what it could be good for. For a research project, this is fine. However, the RepRap will never achieve viral status among the general public unless it&#8217;s <em>actually useful </em>to people who don&#8217;t care about technology. Which, for the purposes of this discussion, is everyone.</p>
<p>Now, 3D printers are just the thing for low-volume runs and prototypes, which is why such machines have traditionally fallen under the category of  &#8221;rapid prototyping.&#8221; It&#8217;s a great technology, and I am very glad to see someone attempting to bring the price down. But it will not be a widely adopted consumer technology until it&#8217;s a better way to get stuff than going to the store or ordering it online. This means designing lots of useful things that are cheaper or easier to manufacture on the desktop than they are to obtain through the usual channels. Unfortunately, those places where consumer object distribution is most limited (it&#8217;s hard to get a martini glass in the Sahara) are exactly those places where it would be hardest to get parts for a home-built desktop manufacturing machine. Put bluntly, the RepRap team does not yet have a product that someone wants.</p>
<p>I have in mind a detailed analysis of common consumer items and their availability. Small plastic widgets are manufactured by the millions in China, and so are of interest only if specialized and currently difficult to obtain. But this is the market niche that high-end rapid prototyping machines already occupy, so no go. We won&#8217;t be desktop-manufacturing plastic spoons any time soon. More promising are assemblies of several parts, such as toys and small machines. Maybe spoons aren&#8217;t interesting, but I wonder if an entire suite of kitchen utensils on demand would be, including egg beater blades and corkscrews, or an entire set of model cars lovingly 3D modeled by online enthusiasts. After that, we rapidly get into objects that a RepRap cannot hope to produce. No one will be manufacturing their own light bulbs or microwaves any time soon.</p>
<p>Except that asking about the manufacture of existing objects is a little deceptive. Current consumer goods have been designed to optimize the cost per unit when thousands or millions are produced in a single factory. Desktop manufacturing imposes a different economics: the relevant parameters are raw materials cost, printing time, manual assembly time, and of course the requirement for parts that cannot be fabricated on the current generation of printers. An analysis of what could be made on the desktop must also be an analysis of how existing classes of objects could be redesigned to be amenable to desktop manufacturing. It&#8217;s not just about building a printer, but about redesigning the entire manufacturing supply chain and inventing entirely new fabrication methods. Want to reduce the external parts count? Perhaps new types of printable plastic fasteners can replace screws. Is the thing too complex for the user to assemble after printing? Integrated <a href="http://en.wikipedia.org/wiki/Monocoque">monocoque</a> designs might be the answer. As far as I know, no one has yet tried to design a monocoque toaster.</p>
<p>This sort of research will also answer questions about what capabilities are most sorely missing in the current generation of printers. The ability to fabricate electronics is obviously desirable, and metal film deposition seems like a good first step. But is it really? The key criterion when evaluating any new capability must be how many useful objects could be fabricated. Asking what fraction of the parts in a RepRap could be made by a RepRap is only interesting when desktop manufacturing technology starts to become competitive with standard manufacturing techniques for complex electro-mechanical objects.</p>
<p>Which brings us to cost. I love the idea of downloading the design for whatever I want, but I still don&#8217;t think I&#8217;m going to print the vast majority of things I need. For plastic spoons, an injection moulding machine plus international shipping is going to be cheaper than desktop manufacturing for a long, long time. This gives desktop manufacturing a potential advantage for complex objects or short runs, especially if the labor of assembly can be avoided by automated production. Unfortunately, complex objects and minimal manual assembly are precisely what current desktop manufacturing technology is worst at. I can imagine a very advanced machine that can make its own integrated circuits and assemble them too, perhaps with its own little robot arm that is controlled by the downloaded manufacturing program. Awesome, and one day potentially a cheaper way to get one&#8217;s hands on that hot new laptop design, but that capability is a very long way away. Instead, the desktop manufacturing research program needs to ask itself what sorts of objects are not only useful and possible in the near term, but expensive to manufacture or distribute by conventional techniques. As an analogy, anyone can now print a book on their laser printer, yet we still buy bulk-printed books.</p>
<p>Finally, it seems to me that the RepRap team is trying to solve two problems at once, and is unclear about their separation:</p>
<ol>
<li>Developing a useful desktop manufacturing capability</li>
<li>Getting that capability into the hands of everyone</li>
</ol>
<p>The self-replicating exponential growth ecological analogy is a beautiful conceptual solution to #2, and it is also a useful technological driver for #1 because a 3d printer is a pretty complex thing to print. The RepRap team is also proud to be distributed, open-source, etc. and this is an admirable approach to #1.  However, these are far from the only solutions to these two problems.</p>
<p>Starting with the distribution problem, a moment&#8217;s thought reveals that it&#8217;s already extremely well solved! Getting a physical object to whoever wants it wherever they are is all but trivial at this point in history. We do already this with everything from soft drinks to mobile phones (and believe me, <em>everybody</em> has both.) The problem is not getting the object to people, it&#8217;s making it as cheap as possible to do so. This means that driving down the cost of desktop manufacture is the key goal; self-replication is only interesting if it&#8217;s cheaper than assembling it in China and shipping it.</p>
<p>As for the technology development problem, there are lots of approaches other than distributed and open source. Many basic technologies have come out of government research programs (such as jet engines and the internet) and private enterprise is of course reknowned for efficiently producing and distributing innovation. So rather than distributed global self-replication, how about this plan: figure out a way that someone can make money off of the idea, at least for a little while. Do the research discussed above and write a business plan. Take the core RepRap team and add to them the best mechanical engineers, manufacturing specialists, and consumer product designers that the world has to offer. Take the plan and the team to your favorite venture capital firm, and ask for a few hundred million dollars. I&#8217;m willing to bet that a well-funded team of crack personnel could solve the daunting technical problems of useful desktop replication much faster than the current distributed organization, and the debt to the VC would provide a strong incentive to build something that people actually wanted. This doesn&#8217;t immediately imply monopoly: open-source the design if you like, and make money off being the first to get there. There is no shortage of potential ethical business plans.</p>
<p>If the goal is to develop desktop replication technology to a useful state and get it into the hands of as many people as possible as quickly as possible, then I am not at all sure that self-replication is a useful near-term design goal. I do love the idea, and I believe that, eventually, self-replication will become a useful manufacturing strategy. I also really like the upgrade bootstrapping concept, where each new generation of manufacturing machine is designed to be fabricated on the last. This is an approach that will allow advances in manufacturing technology to spread at the speed of information. But we&#8217;re not there yet. Those working on desktop manufacturing technology today will need to concentrate on cost and usefulness, probably for many years to come.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/self-replicating-desktop-manufacturing/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>The Glissendo Performance, and Machine-Assisted Circus</title>
		<link>http://jonathanstray.com/the-glissendo-performance-and-machine-assisted-circus</link>
		<comments>http://jonathanstray.com/the-glissendo-performance-and-machine-assisted-circus#comments</comments>
		<pubDate>Tue, 16 Jun 2009 02:42:44 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[art]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=758</guid>
		<description><![CDATA[I was recently pointed to the most amazing thing, a music / fire / street performance called Glissendo, conceived by one &#8220;Ulik, the Machanical Clown&#8221; and executed by French art group Le Snob. They&#8217;re playing &#8220;Lightning&#8221; by Phillip Glass on a Dixieland band, riding Segways under the robes, and of course the band leader has [...]]]></description>
			<content:encoded><![CDATA[<p>I was recently pointed to the most amazing thing, a music / fire / street performance called Glissendo, conceived by one &#8220;Ulik, the Machanical Clown&#8221; and executed by French art group <a href="http://www.fanfarelesnob.com/index2.htm">Le Snob</a>.  They&#8217;re playing &#8220;Lightning&#8221; by Phillip Glass on a Dixieland band, riding Segways under the robes, and of course the band leader has dual hand-mounted flamethrowers.</p>
<p style="text-align: center;"><object width="425" height="344" data="http://www.youtube.com/v/KYykpRRuHQM&amp;hl=en&amp;fs=1&amp;" type="application/x-shockwave-flash"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/KYykpRRuHQM&amp;hl=en&amp;fs=1&amp;" /><param name="allowfullscreen" value="true" /></object></p>
<p style="text-align: left;">Elegant, beautiful, and strangely sad.</p>
<p style="text-align: left;">The only substantial thing I can find on this Ulik character is <a href="http://www.youtube.com/watch?v=tvL5DOu3G2">this video</a>. In it, Ulik performs with some of his contraptions such as a home-made jet-engine backpack (used with skis or rollerblades), a life-sized puppet who holds a camera and interviews him, and the front half of a car. It&#8217;s all wonderfully creative stuff, and it makes me wonder why we haven&#8217;t seen more hi-tech in circus.</p>
<p style="text-align: left;">For the potential is ample. We could use modern <a title="example: the Segway balancing control algorithm " href="http://cgi.cse.unsw.edu.au/~cs4411/wiki/index.php?title=Segway_Control_Algorithms">control-system</a> technology to perform previously impossible man-machine feats of daring. I wonder about automatically balancing Segways 30 feet high than one could dance on top of,  harnesses connected to a crane that cancels out its own friction and inertia and modulates the effective gravity under performer control, a ridiculously precise robotic juggling partner, or powered jumping stilts with built in balance and timing systems. This is not mere robotic circus; at their best, such machines become something between costume and vehicle, an extension of the performer&#8217;s body that makes them, taller, stronger, faster, or able to move excitingly inhuman ways.</p>
<p style="text-align: left;">Given that such wide artistic and technological possibilities exist, I find it hard to believe that they won&#8217;t be developed. We may currently be witnessing the last generation of aerial circus that does not make heavy use of technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/the-glissendo-performance-and-machine-assisted-circus/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>FMRI &#8220;Mind Reading&#8221; Doesn&#8217;t Yet Threaten Humanity</title>
		<link>http://jonathanstray.com/fmri-mind-reading-doesnt-yet-threaten-humanity</link>
		<comments>http://jonathanstray.com/fmri-mind-reading-doesnt-yet-threaten-humanity#comments</comments>
		<pubDate>Mon, 09 Mar 2009 04:49:05 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[fmri]]></category>
		<category><![CDATA[minds]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[science fiction]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=451</guid>
		<description><![CDATA[It is now possible to see what a person is looking at by scanning their brain. The technique, published last November by a team of Japanese neuroscientists, uses FMRI to reconstruct a digital image of the picture entering the eye, albeit at very low resolution and only after hundreds of training runs. Still, it&#8217;s an [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://jonathanstray.com/wp-content/uploads/2009/03/visual-image-reconstruction-from-fmri.png"><img class="size-medium wp-image-453 aligncenter" title="visual-image-reconstruction-from-fmri" src="http://jonathanstray.com/wp-content/uploads/2009/03/visual-image-reconstruction-from-fmri-300x231.png" alt="visual-image-reconstruction-from-fmri" width="300" height="231" /></a></p>
<p>It is now possible to see what a person is looking at by scanning their brain. The technique, published last November by a team of Japanese neuroscientists, uses <a title="FMRI is neat technology " href="http://en.wikipedia.org/wiki/Fmri">FMRI</a> to reconstruct a digital image of the picture entering the eye, albeit at very low resolution and only after hundreds of training runs. Still, it&#8217;s an awesome development, and many articles covering this research have called it &#8220;mind reading&#8221; (<a href="http://scienceblogs.com/pharyngula/2008/12/soon_well_be_reading_your_mind.php">1</a>, <a href="http://arstechnica.com/old/content/2008/12/mindreading-101-identifying-images-by-watching-the-brain.ars">2</a>, <a href="http://www.thenakedscientists.com/HTML/content/news/news/1568/">3</a>, <a href="http://nottotallyrad.blogspot.com/2008/12/mri-mind-reading-imaging.html">4</a>, <a href="http://www.pinktentacle.com/2008/12/scientists-extract-images-directly-from-brain/">5</a>). But it really isn&#8217;t, and it&#8217;s fun to explore what real &#8220;mind reading&#8221; would imply.</p>
<p>When I hear &#8220;mind reading&#8221; I want psychic abilities. I want to be able to know what number you&#8217;re thinking of, where you were on the night of March 4th, and what you actually think of my souffle. This is the sort of technology that could be badly misused, as the <a title="hysteria!" href="http://www.pinktentacle.com/2008/12/scientists-extract-images-directly-from-brain/">comments</a> on one blog note:</p>
<blockquote><p>Am I the only one finding this DEEPLY disturbing? It opens the doors to some of the scariest 1984-style total-control future predictions. Imagine you can’t hide your f#&amp;%!ng MIND!</p></blockquote>
<p>Fortunately, we&#8217;re not there yet. Morover, if we did have the technology to read minds, we&#8217;d have much bigger societal issues than privacy to deal with. The existence of &#8220;mind reading machines&#8221; would imply that we possessed good formal models of the human mind, and <em>that</em> is a can of worms.</p>
<p><span id="more-451"></span>But back to today. The <a href="http://jonathanstray.com/papers/Visual%20Image%20Reconstruction%20from%20Human%20Brain%20Activity%20using%20a%20Combination%20of%20Multiscale%20Local%20Image%20Decoders.pdf">paper</a> by Yoichi Miyawaki and colleagues describes a technique for exploiting <a title="cool word!" href="http://en.wikipedia.org/wiki/Retinotopy">retinotopy</a>, the fact that certain areas of the visual cortex are direct &#8220;maps&#8221; of the retina. First, a series of 10&#215;10 black and white test images are shown to a someone while their neural activation is recorded by FMRI. The responses to these test images are used to ascertain which areas of the visual cortex correspond to which areas of the subject&#8217;s field of vision. When the neural map is complete, it can be read &#8220;backwards,&#8221; going from neural scanner results to a low resolution representation of whatever the subject is currently looking at.</p>
<p>This is a long way from a tool for the thought-police. First, the algorithm requires training on each new person. Also, an MRI machine is a huge, expensive, complicated piece of machinery which requires the subject to stay very still over a period of minutes &#8212; widespread brain scanning is, for the moment, completely out of the question. But most fundamentally, the information recovered is nothing more than what the eye is currently looking at. You might as well just tape a digital camera to the subject&#8217;s head. The pictures would be a lot better.</p>
<p>What is it that we imagine for a mind reading machine? Perhaps a printout, in words, of every thought that goes through someone&#8217;s mind. But <a title="it's an old question" href="http://en.wikipedia.org/wiki/Sapir_Worf_Hypothesis">do people really think exclusively in words</a>? What about their emotions, or their unconscious responses, or even the complete set of minor joint aches and temperature sensations all over their body? Or how about a video playback of the events of yesterday evening? Impossible, because <a href="http://health.howstuffworks.com/human-memory.htm/printable">that&#8217;s not how human memory works</a>. When we think about it carefully, we realize that we have an extremely poor conception of what is actually &#8220;in someone&#8217;s head.&#8221;</p>
<p>Compounding this problem is the fact we can&#8217;t even say what&#8217;s in our <em>own</em> heads. <a title="Minds are trick things" href="http://jonathanstray.com/minds-are-tricky-things-part-iii">We think we can, but we can&#8217;t.</a> Decades of psychological experiments show that access to the contents of our own minds and the working of our own thought processes is very limited. Consequently, we cannot answer the question &#8220;what would a mind-reader read?&#8221; through introspection.</p>
<p>This is why, before we could build a mind-reading machine, we would first need formal models of a &#8220;mind.&#8221; We need the sort of mathematical models that one can manipulate with a computer, because computers will surely be intensely involved in any mind reading technology. If recent developments in linguistics and artificial intelligence research are any guide, these models will be huge, associative, and statistical in nature, nothing like the structured logic we think we possess. For example, Google translates web pages between different languages <a title="Like nearly all machine translation, lots of data, little code" href="http://www.technologyreview.com/business/17793/?a=f">without using anything like formal grammar</a> models.</p>
<p>In other words, we cannot &#8220;read minds&#8221; because we have very little idea of how minds might be stored on a computer. This problem is known in AI as &#8220;<a href="http://en.wikipedia.org/wiki/Knowledge_representation">knowledge representation</a>,&#8221; and we still know very little about it.</p>
<p>Good formal models of the mind, if possible, are the technological precursor to entire fields of information engineering, and this is why I&#8217;m not worried about mind-reading technology per se. We&#8217;ll get beneficial things like accurate machine translation and computers that respond to voice queries &#8212; no more fighting with software that just doesn&#8217;t understand what you want. (Think also of the possibilities for art and expression.) We&#8217;ll also get uncomfortable technologies like sickeningly effective advertisements that exploit behavioral quirks we didn&#8217;t know we had, and NSA-funded conversation snooping programs that make existing keyword scanners look like the toys that they are. Finally, it would be possible to use accurate human mind models for pure evil: imagine a computer virus that was designed to read your personal files and figure out how best to convince you that the Dictator was beneficent. All of this may sound very far-fetched, but we&#8217;re going to build these things if we possibly can: think of how much money Google makes from each percentage point of improvement in ad clickthroughs.</p>
<p>If the Japanese FMRI technique seems positively simplistic in this light, that&#8217;s because it is. They have read retinas, not minds. They are extracting a representation we already have abundant experience with: images. Saying that we&#8217;ve made a step towards reading minds is ridiculous; Thomas Edison might just as well have claimed to &#8220;record thoughts&#8221; when he announced the phonograph.</p>
<p>I bother with all of this both because I think science journalism is often done badly, and because I believe that it&#8217;s important to get hysterical about the right thiings. One comment posted to a <a title="kinda neat" href="http://www.youtube.com/watch?v=daY7uO0eftA">video of the research</a> reads, &#8220;this is the beginning of the end of free thought.&#8221; Perhaps the continuation of this type of FMRI research really will one day lead to the ability to determine what someone is thinking without invoking their consent, but torture already does that. To me, the ability to represent someone&#8217;s thoughts in electronic form has far greater implications than mind-reading per se, and this sort of FMRI research &#8212; as impressive as it is &#8212; contributes little to that enterprise.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/fmri-mind-reading-doesnt-yet-threaten-humanity/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>How Many World Wide Webs Are There?</title>
		<link>http://jonathanstray.com/how-many-webs</link>
		<comments>http://jonathanstray.com/how-many-webs#comments</comments>
		<pubDate>Wed, 04 Feb 2009 23:53:51 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[belief]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[information visualization]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[translation]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=257</guid>
		<description><![CDATA[How much overlap is there between the web in different languages, and what sites act as gateways for information between them? Many people have constructed partial maps of the web (such as the  blogosphere map by Matthew Hurst, above) but as far as I know, the entire web has never been systematically mapped in terms [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://datamining.typepad.com/gallery/blog-map-gallery.html"></a><a href="http://datamining.typepad.com/gallery/blog-map-gallery.html"><img class="alignnone size-medium wp-image-328" title="newblog-crop" src="http://jonathanstray.com/wp-content/uploads/2009/02/newblog-crop-300x274.png" alt="newblog-crop" width="300" height="274" /></a></p>
<p>How much overlap is there between the web in different languages, and what sites act as gateways for information between them? Many people have constructed partial maps of the web (such as the  <a href="http://datamining.typepad.com/gallery/blog-map-gallery.html">blogosphere map</a> by Matthew Hurst, above) but as far as I know, the entire web has never been systematically mapped in terms of language.</p>
<p>Of course, what I actually want to know is, how connected are the different cultures of the world, really? We live in an age where the world seems small, and in a strictly technological sense it is. I have at my command this very instant not one but several enormous international communications networks; I could email, IM, text message, or call someone in any country in the world. And yet I very rarely do.</p>
<p>Similarly, it&#8217;s easy to feel like we&#8217;re surrounded by all the international information we could possibly want, including direct access to foreign news services, but I can only read articles and watch reports in English. As a result, information is firewalled between cultures; there are questions that could very easily be answered by any one of tens or hundreds of millions of native speakers, yet are very difficult for me to answer personally. For example, what is the journalistic slant of <a href="http://www.aljazeera.net/portal">al-Jazeera</a>, the original one in Arabic, not the <a href="http://www.aljazeera.com/index.html">English version</a> which is produced by a completely different staff?  Or, suppose I wanted to know what the average citizen of Indonesia thinks of the sweatshops there, or what is on the front page of the Shanghai Times today&#8211; and does such a newspaper even exist? What is written on the 70% of web pages that are not in English?</p>
<p><span id="more-257"></span>We all live on the same physical planet, but the information worlds we inhabit must be vastly different. This are many reasons for this other than language, but language alone is enough to isolate humanity from itself.</p>
<p>And so, my question: how many islands are there in our multi-cultural information space, and how are they connected? I am willing to bet that a full-scale web map would show several large networks in the <a title="What languages are web pages in?" href="http://www.internetworldstats.com/stats7.htm">main languages of the web</a> &#8212; English, Chinese, Spanish, Japanese, German, etc. &#8212; but few connections between them, web sites frequented by bilingual or bi-cultural individuals, who after all are the true gateways between cultures. The structure of the interconnections might tell us something about the relationships between cultures, and the actual number of links might provide some measure of how close or how far apart we actually are. The individual URLs themselves would also be extremely valuable information, representing high-bandwidth links between cultures, the trans-occeanic fiber between continents in the infosphere.</p>
<p>There is a second geography to the world that we&#8217;ve never seen. I don&#8217;t even know what I&#8217;m missing.</p>
<p>Creating such a map would be a trick, but by no means out of the reach of an academic project or a small company. <a href="http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html">Google</a> says there are currently over one trillion (10^12) unique web pages (for their particular definition of &#8220;unique&#8221;, which is more complex than it might seem.) Unlike a search engine, a language-based web map does not require the full contents of every page, merely the outgoing URLs and a discrete categorization of the language (which can be <a title="TextCat, mmrow!" href="http://odur.let.rug.nl/~vannoord/TextCat/">automatically determined</a> even without any document meta-data.)  Assuming that each URL  is assigned a unique 32 bit ID, another 32 bits for language and other info, and then links to an average of 20 other pages (<a href="http://uclue.com/index.php?xq=1015">estimates vary</a>), this is about 100 terrabytes of data &#8212; or perhaps $15000 worth of storage at current prices. This index could be created from a fresh crawl, or by parsing an existing one, such as from the folks at the brand new and very awesome <a title="Sooo Awesome!" href="http://www.dotnetdotcom.org/">DotBot open index of the web</a>.</p>
<p>The next step would be to generate the visualization of such a massive data set. The complete graph could be laid out in two or three dimensions using existing <a href="http://www.informatik.tu-cottbus.de/~an/GD/">clustering methods</a>. The resulting map could be traversed using GPU-accelerated rendering techniques for very large data sets, probably after some sort of hierarchical pre-processing that produces proxies for zoomed-out views of the network. A usuable UI would be crucial; the entire map needs to be navigable at multiple scales and composed of live, hyperlinked objects. The right visualization also depends on what you are trying to discover;  ultimately, there can be no single map because the choice of visualization is dependent upon usability and aesthetics, as the huge variety of beautiful maps at <a href="http://www.visualcomplexity.com/vc/">Visual Complexity</a> demonstrate.</p>
<p>The analysis could go much deeper with more computing power. Machine translation is currently poor, but it is probably good enough to detect whether one document is a translation of another. With this capability, we would actually be able to quantify the percentage of (public) textual information that makes it from one language into another and identify the key organizations that act as conduits. Further study might reveal fascinating things, such as selection biases in the types of news or information that get translated. The implications for differences in belief between cultures are obvious.</p>
<p>Yet even  a &#8220;links only&#8221; data set could still answer some highly revealing questions,  such as &#8220;what percentage of web sites are visited by people from multiple cultures?&#8221; or even &#8220;what is the best gateway between Polish and English film reviews?&#8221; This could be done without visualization, but it would be a mistake not to draw the actual maps.  Not only do pictures engage our spatial reasoning in a way that raw bits never can, but such a map would re-make an obvious point that is too often lost: in terms of communication between cultures, the world is not nearly as small or interconnected as we&#8217;d like to think it is.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/how-many-webs/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Internet Censorship Looks Like, Part 2</title>
		<link>http://jonathanstray.com/what-internet-censorship-looks-like-part-2</link>
		<comments>http://jonathanstray.com/what-internet-censorship-looks-like-part-2#comments</comments>
		<pubDate>Sun, 11 Jan 2009 12:32:28 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[censorship]]></category>
		<category><![CDATA[developing world]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[internet filtering]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[turkey]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=173</guid>
		<description><![CDATA[The Turkish Government censors internet access from within the country, as I discovered yesterday when attempting to access YouTube from the Turkish town of Selçuk, as this screenshot shows (click to enlarge): The English text on this page reads: &#8220;Access to this web site is banned by &#8216;TELEKOMÜNİKASYON İLETİŞİM BAŞKANLIĞI&#8217; according to the order of: [...]]]></description>
			<content:encoded><![CDATA[<p>The Turkish Government censors internet access from within the country, as I discovered yesterday when attempting to access YouTube from the Turkish town of Selçuk, as this screenshot shows (click to enlarge):</p>
<p style="text-align: center;"><a href="http://jonathanstray.com/wp-content/uploads/2009/01/web-censorship-in-turkey.png"><img class="size-medium wp-image-174 aligncenter" title="web-censorship-in-turkey" src="http://jonathanstray.com/wp-content/uploads/2009/01/web-censorship-in-turkey-300x177.png" alt="web-censorship-in-turkey" width="300" height="177" /></a></p>
<p>The English text on this page reads: &#8220;<span class="yazi3">Access to this web site is banned by &#8216;TELEKOMÜNİKASYON İLETİŞİM BAŞKANLIĞI&#8217; according to the order of: Ankara 1. Sulh Ceza Mahkemesi,</span><span class="yazi3_1"> 05.05.2008 of 2008/402&#8243;</span></p>
<p>Just to complete the irony, I was looking for a video of the <a href="http://en.wikipedia.org/wiki/Oscar_Grant">Oscar Grant shooting</a> when I first discovered this &#8220;blocked site&#8221; page.</p>
<p><span id="more-173"></span>I have previously reported on <a href=" http://jonathanstray.com/what-does-internet-censorship-look-like  ">internet censorship in the United Arab Emirates</a>. Turkey&#8217;s &#8220;you can&#8217;t see this&#8221; page is not nearly as flashy, and the censorship may be less severe: I can reach Flickr from here, for example. However, it is not possible to read the website of <a href="http://richarddawkins.net">Richard Dawkins</a> in Turkey; there even appears to be a more specific (and forthright?) banner page.</p>
<p style="text-align: center;"><a href="http://jonathanstray.com/wp-content/uploads/2009/01/richard-dawkins-is-censored-in-turkey.png"><img class="size-medium wp-image-175 aligncenter" title="richard-dawkins-is-censored-in-turkey" src="http://jonathanstray.com/wp-content/uploads/2009/01/richard-dawkins-is-censored-in-turkey-300x177.png" alt="richard-dawkins-is-censored-in-turkey" width="300" height="177" /></a></p>
<p>(Sadly, Google Translate does not support Turkish &#8212; dear lazywebs, can anyone out there give an exact translation?)</p>
<p>This suggests that Turkey&#8217;s censorship attempts &#8212; all of which can  be easily circumvented with tools like <a href="http://torproject.org">Tor</a> &#8212; are more concerned with social and religious mores of various sorts, as opposed to the efforts of countries like China where there is a clear political motive underlying the censorship pattern (for example, the <a href="http://en.wikipedia.org/wiki/Tiananmen_Square_protests_of_1989">Tiananmen Square killings</a> never happened, according to <a href="http://news.bbc.co.uk/1/hi/technology/4645596.stm">Google China</a>.)</p>
<p>For more, please see the fabulous <a href="http://opennet.net">Open Net Initiative</a>, which tracks and reports on internet censorship worldwide, and has an excellent <a href="http://opennet.net/node/988">review article</a> on the Turkish situation. Unsurprisingly, Turkey also has had some recent problems with <a href="http://en.wikipedia.org/wiki/List_of_prosecuted_Turkish_writers">freedom of expression</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/what-internet-censorship-looks-like-part-2/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>From Electronic to Cryptographic Voting</title>
		<link>http://jonathanstray.com/from-electronic-to-cryptographic-voting</link>
		<comments>http://jonathanstray.com/from-electronic-to-cryptographic-voting#comments</comments>
		<pubDate>Wed, 07 Jan 2009 12:52:16 +0000</pubDate>
		<dc:creator>Jonathan Stray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cryptography]]></category>
		<category><![CDATA[democracy]]></category>
		<category><![CDATA[elections]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[voting]]></category>

		<guid isPermaLink="false">http://jonathanstray.com/?p=170</guid>
		<description><![CDATA[Electronic voting machines were supposed to make elections cheaper, faster, and more secure, but so far they have failed. In the last decade there has been something of a rush to adopt  e-voting, followed by suspicion and controversy over the black-box, &#8220;just trust us&#8221; nature of the first generation of commercial systems, followed by a [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Electronic_voting">Electronic voting</a> machines were supposed to make elections cheaper, faster, and more secure, but so far they have failed. In the last decade there has been something of a rush to adopt  e-voting, followed by suspicion and <a href="http://www.schneier.com/blog/archives/2006/11/voting_technolo.html">controversy</a> over the black-box, &#8220;just trust us&#8221; nature of the first generation of commercial systems, followed by a <a href="http://www.washingtonpost.com/wp-dyn/content/story/2008/10/29/ST2008102904129.html">return to paper ballots</a> in many jurisdictions. However, if we wish to improve election processes, cheap and fast is probably the wrong goal. It may be possible to use cryptographic techniques to implement <a href="http://en.wikipedia.org/wiki/End-to-end_auditable_voting_systems">end-to-end auditable</a> elections, new in human history.</p>
<p>The e-voting fiasco has illustrated that paper ballots are a better system than they might at first seem. Paper preserves voter secrecy, it is auditable after the fact, and it is even reasonably transparent, if one also allows election observers. But paper ballots must be closely guarded and cannot be directly counted by members of the general public, who in the end have no choice but to trust election officials,  observers, counting equipment, and the entire <a href="http://en.wikipedia.org/wiki/Chain_of_custody">chain of custody</a>. Rather than simply duplicating paper ballots electronically, we should strive to improve upon them.</p>
<p>This seems to be possible. Modern cryptography suggests the possibility of a new kind of incredibly transparent and fair election, where ordinary citizens can verify the soundness of the election for themselves, without ever needing to trust blindly that a huge array of machines and people have acted correctly. This represents a fundamentally new ability: for the first time, it may be possible to hold truly &#8220;open&#8221; elections.</p>
<p><span id="more-170"></span><strong>What are we trying to accomplish?</strong></p>
<p>Ideally, a democratic voting system would satisfy the following criteria:</p>
<ul>
<li>Secrecy: to prevent coercion or vote-buying, each person&#8217;s vote must be secret in perpetuity.</li>
<li>Transparency: all voting procedures must be public and understandable by everyone.</li>
<li>Verifiability: it must be possible to independently audit or validate the election results.</li>
<li>Usability: it must be easy to vote, and cheap to deploy the system for hundreds of millions of voters.</li>
</ul>
<p>In this way, each person would vote freely, while the entire society could have confidence in the outcome. The difficultly with these criteria is that they conflict: it is hard to preserve both secrecy and verifiability in a simple, transparent way.</p>
<p>Paper ballots fall short of the these ideals in many ways. They are nicely secret, and the process is reasonably transparent as there are public laws describing exactly how the votes are to be tallied, regulations providing for election observers, etc. However, independent audits are not really possible, because they require access to a large quantity of fragile and politically sensitive paper. In principle, we would like it to be possible for any regular citizen with sufficient time on their hands to perform a complete audit of the election results.</p>
<p>Elections results could be openly verified by publishing copies of every ballot cast, but only if there was some way to ensure that these copies were accurate. This could be done by issuing to each citizen to some sort of receipt of their vote which could be checked against the public list, but then votes would not be secret: they could be coerced or bought by offering a clandestine cash reward for receipts.</p>
<p>While paper ballots leave much to be desired, current electronic voting systems are worse. All e-voting machines are essentially &#8220;black boxes&#8221; that transform the voter&#8217;s choice into a final tally by some complex and unknown process. This makes them completely non-transparent. In the worst case, paperless <a href="http://en.wikipedia.org/wiki/DRE_voting_machine">direct recording electronic</a> (DRE) voting machines are not at all verifiable, which makes them subject to both invisible malfunction and deliberate hacking  (either <a href="http://www.openvotingfoundation.org/tiki-read_article.php?articleId=1">in the voting booth</a> or <a href="http://www.foxnews.com/story/0,2933,133214,00.html">at the tally station</a>.) There has never been a convincingly documented case of miscount or fraud with DRE machines, but that may only be because such machines leave absolutely no record of the election process!</p>
<p>Because of this, many American states <a href="http://select.nytimes.com/2007/01/10/opinion/11talkingpoints.html?_r=1">now require a paper record</a> even for otherwise electronic machines, but even paper audit trails are problematic: when is an audit performed? Will all ballots be routinely audited or just a sample? What is <a href="http://www.votetrustusa.org/pdfs/VTTF/EVEPAuditing.pdf">the right sample size for confidence</a> in the results? What happens if a discrepancy is discovered? Meanwhile, other states have <a href="http://www.washingtonpost.com/wp-dyn/content/story/2008/10/29/ST2008102904129.html">gone back to paper ballots entirely</a> and a number of electronic voting machines have been <a href="http://www.theregister.co.uk/2004/05/03/cal_decertifies_diebold/">de-certified</a>.</p>
<p><strong>Cryptographic Hope</strong></p>
<p>Enter cryptography, the discipline that has brought us such miracles as secure communication between two parties who have never exchanged any information in secret (<a title="it's still amazing to me the public-key cryptography is possible" href="http://en.wikipedia.org/wiki/Public-key_cryptography">public-key cryptography</a>), tamper-proof  electronic documents (<a href="http://en.wikipedia.org/wiki/Digital_signature">digital signatures</a>), and the ability to prove that one knows a secret without giving it away (<a title="zero-knowledge proofs are magic" href="http://en.wikipedia.org/wiki/Zero_knowledge_proof">zero-knowledge proofs</a>.) In the wake of these achievements, there has been some hope that proper cryptographic protocols will simultaneously solve the secrecy, transparency, and verifiability issues.</p>
<p>Voting might still be electronic in a cryptographic system, but the security of an election would rest on open cryptographic protocols rather than on trusted system implementations, the physical security of ballot boxes, or the honesty of certain people. Even better, the election results would be auditable at any time from public information, and each voter could verify that their own ballot was correctly recorded yet their personal vote would remain secret and unprovable.</p>
<p>Such a system is said to be <a href="http://en.wikipedia.org/wiki/End-to-end_auditable_voting_systems">end-to-end auditable</a>, and represents a fundamental shift: for the first time, it may be possible to hold completely &#8220;open&#8221; elections in the sense that governments and election officials would have no more authority or power than ordinary citizens. This is unprecedented in human history, and it is exciting.</p>
<p>It is also quite a trick, and has never been demonstrated in practice. Aside from secrecy, transparency, and verifiability, any proposed cryptographic voting system must guard against many different kinds of attacks. These include tampering and &#8220;denial of service&#8221; attacks against the election, such as the ability to spoil the election through some sort of interference (as might suit an opposition group) or to arbitrarily declare that it was spoiled in some non-disprovable way (as a dictator might wish to do.)</p>
<p><strong>A Toy Example: ThreeBallot</strong></p>
<p>Like much of modern cryptography, the simultaneous provision of both secrecy and verifiability seems counter-intuitive. To aid in the study and conceptualization of such systems, professor <a href="http://en.wikipedia.org/wiki/Ron_Rivest">Ron Rivest</a> of MIT (the &#8220;R&#8221; of <a href="http://en.wikipedia.org/wiki/RSA">RSA</a> fame) invented a &#8220;toy&#8221; voting system in 2006 called <a href="http://en.wikipedia.org/wiki/ThreeBallot">ThreeBallot</a>.</p>
<p>It works like this: each voter is given three identical ballots in the voting booth. To vote <em>for</em> a candidate, the voter writes a mark on <em>two</em> randomly chosen ballots; to vote <em>against </em>a candidate, only one randomly chosen ballot is marked.</p>
<p>A valid vote is one in which each candidate is marked on either one (against) or two (for) randomly selected ballots. This could be checked e.g. by an optical scanning machine, much as paper ballots are currently validated at polling stations.</p>
<p>Then the voter secretly chooses one of the ballots and makes a copy of it as a receipt; the others are dropped into the ballot box. Each of the three ballots has a unique serial number.</p>
<p>After the election, all ballots are published publicly, and anyone can tally the election results from these copies. Additionally, each voter can verify that their ballot was published accurately by looking up their receipt in the published list. Yet there is no way to use a receipt to determine who someone voted for, because the voter can arrange to have any particular set of markings on the receipt that they keep. The receipt also prevents tampering, because a would-be tamperer does not know which of the three ballots the voter has retained. Thus there is a 2-in-3 chance of getting away with tampering with (or deleting) any one vote, but only a (2/3)^N chance of getting away with tampering N votes &#8212; like tossing N heads in a row, these are very rapidly shrinking odds.</p>
<p>ThreeBallot was never meant to be a real election system, and in fact in a University class voting experiment ThreeBallot was found to have <a href="http://electionupdates.caltech.edu/?p=761">significant security and usability problems</a>: a third of voters couldn&#8217;t produce a correct set of ballots the first time, and a student &#8220;attacker&#8221; was able to manipulate about 20% of the votes cast, enough to change the election result. He did this in part by clandestinely reading other people&#8217;s receipts, such as those left in voting booths or on desks. This reminds us once again that security is always about much, much more than good cryptography.</p>
<p>However, the basic ideas of ThreeBallot &#8212; randomness in the vote-casting process, voter receipts, published ballots, and probabilistic tampering detection &#8212; are found in virtually all cryptographic voting schemes.</p>
<p><strong>Realistic Proposals<br />
</strong></p>
<p>Serious proposals are somewhat more complex. Some are designed to be entirely electronic while others are additions to paper ballot systems. Major proposals include <a href="http://en.wikipedia.org/wiki/Punchscan">Punchscan</a> (2006) by cryptographer David Chaum, <a href="http://en.wikipedia.org/wiki/Scantegrity">Scantegrity</a> (2007) by David Chaum and Ron Rivest, and <a href="http://eprint.iacr.org/2007/162.pdf">Bingo Voting</a> (2008) by a trio of German researchers. All of these systems are very cryptographically clever, but as always, security in the real world is about much more than cryptography. A <a href="http://naveen.ksastry.com/papers/cryptovoting-usenix05.pdf">2005 paper</a> considered how a real election system employing end-to-end auditable protocols might work, and proposed various non-cryptographic attacks including collection of receipts, social engineering of election workers, and denial-of-service attacks which could invalidate the entire election (such as hacking the voting machines to record spoiled ballots.)</p>
<p>Also, many problems just cannot be solved cryptographically. One major reason why we don&#8217;t have internet voting is that it is impossible to prevent coercion and vote buying if voters can mark their ballots at home. A physical polling booth can at least be secured against witnesses &#8212; though not against, say, someone who will pay for cell-phone camera pictures of a suitably marked ballot. A completely secure voting system is  probably completely impossible.</p>
<p>Nonetheless, there is hope for electronic voting systems, not because they would allow us to vote cheaper or faster or more conveniently, but because they hold the promise of more transparent elections. Would-be designers and implementers of voting systems must realize that the purpose of a voting system is not just to count votes, but to ensure that everyone believes that the process was fair, and to ensure that this fairness can be proved as easily and as widely as possible.</p>
]]></content:encoded>
			<wfw:commentRss>http://jonathanstray.com/from-electronic-to-cryptographic-voting/feed</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>

