This is a post about how Faster is Different. This is a topic I'll return to frequently, but I want to make a point about it up front. Programmers think about the resources available to them in terms of space/time. This isn't the Physicist's space/time; this is about computers.
Computers process stuff (in time), and they store stuff (in space). So, roughly, processors = time, and memory/disk = space. Processors come at different speeds (operations per second), and with different numbers of cores. More of either gives you more time, so much that you can have
more than 1second per second of time if you have more than one processor. More memory gives you more space.
This is interesting, because we (programmers) measure how expensive a given algorithm is in terms of space/time; and we measure how hard a given problem is in terms of space/time as well. It might seem strange to make the distinction between algorithms and problems, but sometimes you don't use the "best" algorithm to solve a problem. Generally, a problem is defined to be as hard as the absolute best algorithm that will solve it. If you don't have enough space/time to solve a problem, then you can't. This is the basis for cryptography, very hard problems.
Anyway, back to the main point: Faster is Different.
More space/time doesn't just make your existing stuff faster, it sometimes changes a problem with no solution (because it was too expensive) into a problem with a wildly expensive solution. And problems which are wildly expensive today will be very expensive a year from now, quite expensive 3 years from now, and sorta cheap in 5 years. 10 years from now, they'll be free. All thanks to Moore's Law, and its impact on our future supply on space/time, assuming it doesn't stop. Now, there's reasons to question Moore's Law, but that's for another post.
In this installment of Faster is Different, I want to talk to you about
The Swarm. The Swarm is, roughly, everybody
and their computers. The Swarm can do a lot of interesting things, and the Swarm is always watching you. Right now, that doesn't really matter, because it doesn't have a lot of space/time available to it. But that's gonna change.
My employer, Google, just posted some interesting research results in computer vision. I wasn't involved in any way, but its germane to the topic of this post:
Official Google Blog: A new landmark in computer vision
Our research builds on the vast number of images on the web, the ability to search those images, and advances in object recognition and clustering techniques. First, we generated a list of landmarks relying on two sources: 40 million GPS-tagged photos (from Picasa and Panoramio) and online tour guide webpages. Next, we found candidate images for each landmark using these sources and Google Image Search, which we then "pruned" using efficient image matching and unsupervised clustering techniques. Finally, we developed a highly efficient indexing system for fast image recognition.
While we've gone a long way towards unlocking the information stored in text on the web, there's still much work to be done unlocking the information stored in pixels. This research demonstrates the feasibility of efficient computer vision techniques based on large, noisy datasets. We expect the insights we've gained will lay a useful foundation for future research in computer vision.
If you're interested to learn more about this research, check out the paper.
Now first, I want to say that this is really neat research. And the math is cool. And the people involved have every reason to be proud of themselves.
Microsoft has an image group which regularly produces equally cool results. A while back, they came up with an application that was so popular, they made it a publicly accessible toy:
Microsoft Live Labs: Photosynth
You can share or relive a vacation destination or explore a distant museum or landmark. With nothing more than a digital camera and some inspiration, you can use Photosynth to transform regular digital photos into a three-dimensional, 360-degree experience. Anybody who sees your synth is put right in your shoes, sharing in your experience, with detail, clarity and scope impossible to achieve in conventional photos or videos.
So, what properties would we expect from the composition of these services? It would find images of commonly photographed sites, and would then abstract them into 3d models of the space, with textures. All of this without sending anyone out to scan or photograph the site. Now, add a few computational generations to this, say 5 years; and people will be doing this for video.
And if you'll recall, pretty much everyone's launched facial recognition software in the past year. Apple did it, Google did it, I lose track. It still isn't great, but its pretty good, and more space/time and samples can only make it better.
But because Faster is Different, what Google and Microsoft can do today, pretty much any company can do in 3-5 years, and small organizations will be able to do in 7-8 years.
So I want you to forget about Big Brother, or Evil Corporations, or The Bad Guys. Over the next 10 years,
The Swarm is gonna map the entire world, and it is going to be able to tell, from the pixels alone, who's in any photo on the web, and where it was taken, and probably to some extent, when it was taken. You cannot stop this from happening, because as it gets cheaper, more and more organizations will independently do this themselves. This is inevitable and implied, because:
- People want to know,
- People profit from knowing, and
- Faster is Different.
Labels: faster is different, surveillance