Harry Potter and the case of the digital camera
by Michael HoffmanTuesday, July 24th, 2007
If I took a picture with a digital camera and then shared that picture online, all anonymously, could you figure out it was me that took the picture?
This question came up recently when someone who got an early copy of the new Harry Potter book — Harry Potter and the Deathly Hollows — took a picture of every page and then posted those pictures to the web.
It turns out that digital cameras not only take photos. They also embed those digital images with metadata, information that you can’t see with the naked eye, but that you can read with a computer. The idea behind including other data in the photograph is that it can be useful. For example, imagine if photos had built-in geocoding (GPS) data that showed where exactly a photo was taken. With geocoding, your online photo album could list the location photos were taken automatically, and you could even connect to other people’s photos taken in the same location, for example. That would be pretty cool and you can expect it soon.
The person who took the pictures of the Harry Potter book probably didn’t realize that it left digital fingerprints behind. The information that digital cameras now capture is known as Exchangeable Image File Format (Exif) data. With some special reading software we know that the Harry Potter spoiler used a Canon Rebel 350. We know that camera was only sold in the US and Canada. And we actually know the serial number of that specific camera. So, if this person registered the camera online or took it in for service, their name is probably attached to the serial number in some database, easily accessible by the Canon and likely made available to the publisher of Harry Potter upon presentation of a subpoena.
As the TimesOnline (of London) wrote:
Every image that is taken on a digital camera contains Exif data, which holds information about the picture such as zoom, contrast, focus and ‘distance to subject’ measurements. It is typically used for ‘trouble-shooting’, so an owner can ascertain why a picture may not have worked, but it also enables a court, for instance, to establish whether a picture has been digitally altered.
“The Exif data is like the picture’s DNA; you can’t switch it off. Every image has it. Some software can be used to strip or edit the information, but you can’t edit every field,”
Experimentation with meta data
For the Electronic Frontier Foundation, all of this has serious implications for personal privacy. EFF points out that it isn’t only camera that are tracking our every move. Did you know your color laser printer is also spying on you?
Of course, digital cameras aren’t the only devices that may keep a record that could track a document back to its creator. We’ve extensively discussed how most color laser printers invisibly embed the printer serial number and date and time of printing on every page, in a pattern of tiny yellow dots. Although customers have been complaining, printer manufacturers have so far refused to let customers disable the tracking. (HP, for example, recently wrote to update one customer that it was wrong to say initially that it was unable to disable the tracking; instead, it now says it “will not” do so.)
Most computer users are unaware that CD burners in their PCs also contain a similar tracking mechanism that embeds a unique serial number, called a Recorder Identification Code, on every CD they burn.
Does it all smell of Big Brother? Or is this just the world we live in where data is everywhere all the time and we should just relax?

