Drowning in data

Paul Raven @ 01-03-2010

Maybe we’ll have flooded our culture-lungs with angry YouTube comments and pharmaceutical spamblogs before the rising sea-levels get a chance to touch our toes… [via MetaFilter]

According to one estimate, mankind created 150 exabytes (billion gigabytes) of data in 2005. This year, it will create 1,200 exabytes. Merely keeping up with this flood, and storing the bits that might be useful, is difficult enough. Analysing it, to spot patterns and extract useful information, is harder still.

Actually, I don’t see this deluge of data as a bad thing, but I’m very interested in how we’re going to store, manage and curate it.


Here today, gone tomorrow: why the next decade’s web won’t feel familiar

Paul Raven @ 07-10-2009

mosaic of Web2.0 logosPeople seem to be waking up to the impermanence of the web of late. TechDirt points us to a mainstream journalism article at the Globe & Mail, which springboards from the imminent nuking of GeoCities to worrying what will happen to all of your pictures uploaded to Facebook when it eventually (and inevitably) goes the same way. [image by jonas_therkildson]

Lately, there’s been so much discussion about the permanence of information – especially the embarrassing kind – that we have overlooked the fact that it can also disappear. At a time when we’re throwing all kinds of data and memories onto free websites, it’s a blunt reminder that the future can bring unwelcome surprises.

Ten years ago, you could have called GeoCities the garish, beating heart of the Web. It was one of the first sites that threw its doors open to users and invited them to populate its pages according to their own creativity. At a time when the Web was still daunting, it encouraged laypeople to set up their own homepages free of charge.

Kinda like the forerunner of MySpace, then, albeit (somewhat ironically) easier on the eyes and ears… and MySpace’s days are certainly (and mercifully) numbered, if the traffic figures are to be believed. But I digress…

And now, it’s curtains. GeoCities won’t disappear entirely. The Internet Archive – a non-profit foundation based in San Francisco dedicated to backing up the Web for posterity’s sake – is trying to salvage as much as it can before the deadline hits. At least one other independent group is trying to do the same. But this complicates things, because it puts GeoCities users’ data into the hands of an unaccountable third party.

Money-losing websites aren’t exactly novelties. Smaller sites flicker in and out of existence like those bugs that only have 18 hours to mate before they die. But it’s disconcerting to see a big site – one that, long ago, was one of the most popular on the Web – not just fade into obscurity, but come to its end game.

It bring to light some truths about data that are easily overlooked. Websites are like buildings: you can’t just abandon them indefinitely and expect them to keep working. For one thing, that electronic storage isn’t free. Storing files requires media that degrade and computers that fail and power that needs paying for.

The obvious answer here is to make sure you have local backups of anything stored “in the cloud” that you couldn’t bear to lose… but it’s only obvious to those with some degree of computer savvy, and (based on personal experience) everyone else is insufficiently bothered to worry about it ahead of time, no matter how patiently you try to explain the situation. If nothing else, there’ll always be good money for people who can write custom API scraping tools for defunct social networks… that business model will be the new equivalent to the photography studios places who now make their income by scanning and retouching old snapshots from the pre-digital era.

But other changes in the way we use the web are very much afoot, as pointed out by Clive Thompson at Wired. For the last decade, classic search has been the dominant internet tool, propelling Google to the top of the pyramid. But this is the age of Twitter, the temporal gateway into the “real-time web”; maybe the old surfing metaphor will finally make more sense when we’re all riding the Zeitgeist of trending topics:

For more than 10 years, Google has organized the Web by figuring out who has authority. The company measures which sites have the most links pointing to them—crucial votes of confidence—and checks to see whether a site grew to prominence slowly and organically, which tends to be a marker of quality. If a site amasses a zillion links overnight, it’s almost certainly spam.

But the real-time Web behaves in the opposite fashion. It’s all about “trending topics”—zOMG a plane crash!—which by their very nature generate a massive number of links and postings within minutes. And a search engine can’t spend days deciding what is the most crucial site or posting; people want to know immediately.

[...]

“It’s exactly what your friends are going to be talking about when you get to the bar tonight,” OneRiot executive Tobias Peggs says. “That’s what we’re finding.” Google settles arguments; real-time search starts them.

Well, at least we’re not going to be short of things to argue about. If that ever happened, the web would probably close down due to lack of interest… ;)


Ultracapacitors: the game-changer for renewable energy sources?

Paul Raven @ 21-09-2009

Sf author Karl Schroeder points us to a development that may redraw the map for renewable energy use. EEStor, long suspected by some to be the sort of vapourware company that spends a few years making big promises before dissolving in a puff of evaporating venture capital, are believed to have applied for certification of their ultracapacitor technology.

There’s a Wikipedia page on ultracapacitors, which have existed for some time in smaller form, but here’s Schroeder’s summary:

… the ultimate in electricity-storage technology:  a device capable of running your car for hundreds of miles on one charge, and of recharging in under five minutes.  A device that is not a battery, and hence never wears out.  A technology that would make intermittent power generation sources such as windmills directly competitive with baseload generation sources such as coal.

Sounds great, doesn’t it? As pointed out by Schroeder, there’s a great deal of justifiable skepticism around the technology in general and the EEStor news in particular – snake-oil is still a thriving business in the information age, after all. But signs suggest we’ll find out the truth behind the speculation pretty soon… and if we dare to hope that this is the real thing, perhaps we’re about to see what Schroeder calls “a truly disruptive change [...] nothing less than the first nail in the coffin of the fossil fuel age.”


Moore’s Law gets a new lease of life

Paul Raven @ 22-02-2009

digital camera CCD chipGood news for Kurzweilian Singularitarians and flop-junkies – Moore’s Law has been looking increasingly likely to derail as we approach the lowest practical limit for semiconductor miniaturization, but newly announced research means there’s life in the old dog yet:

Two US groups have announced transistors almost 1000 times smaller than those in use today, and a [nano-scale magnet-based] version of flash memory that could store all the books in the US Library of Congress in a square 4 inches (10 cm) across.

[...]

Using 3-nanometre magnets, an array could store 10 terabits (roughly 270 standard DVDs) per square inch, says Russell, who is now working to perfect magnets small enough to cram 100 terabits into a square inch.

“Currently, industry is working at half a terabit [per square inch],” he says. “They wanted to be at 10 terabits in a few years’ time – we have leapfrogged that target.”

If this were Engadget, we could squee about how we’ll have laptops the size of wristwatches by the end of the decade, but that would be to miss an important point. The ever-falling cost and size of memory and processing power will certainly mean more gadgets, but those gadgets will bring social changes along with them – as Charlie Stross pointed out a while ago, if you can read and write data at the atomic scale then physical storage capacity becomes a complete non-issue, allowing you to record everything – literally everything. [image by Fox O'Rian]

When you can record everything, how do you go about managing and using what you’ve recorded?


Thermal memory data storage

Tom James @ 07-01-2009

lavaWe’ve had magnetic memory, semiconductor memory, and memristors: now we have thermal memory with the attendent field of study phononics:

In the current study, Wang and Li take the field of phononics one step further and show the feasibility of a thermal memory that can store data with heat. The scientists predict that such a heat memory could be experimentally realized in the foreseeable future with rapidly advancing nanotechnology. Their work is published in a recent issue of Physical Review Letters.

It seems that just about anything can be turned into a computer or computer component.

[from Physorg][image from sah5515 on flickr]


Next Page »