Emerging Technology from the arXiv

A View from Emerging Technology from the arXiv

The Video Genome Unveiled

The powerful technique behind DNA sequencing can now be used to mine video databases. The key is a new idea known as the video genome.

  • March 31, 2010

Bioinformatics has grown from an obscure branch of computer science to the powerhouse behind molecular biology in just 30 years. In particular, the technique behind gene sequencing has been hugely influential. At least in biology. While all kinds of disciplines such as computer science and mathematics have contributed to its development, bionformatics has yet to repay the favour.

Today that looks to have changed with an announcement of by Alexander Bronstein and buddies at BBK Technologies based near Boston. They’ve found a way to use the technique behind DNA sequencing to match video sequences.

“The problems encountered in video analysis such as identifying a video in a large database, putting together video fragments, finding similarities and common ancestry between different versions of a video, have analogous counterpart problems in genetic research and analysis of DNA and protein sequences,” they say.

Bronstein and co begin by creating “video DNA” for movies. This is information in the form of a sequence of letters that they use to label each scene in a film. The trick, of course, is to find a way to generate this sequence from the scene itself. They do this by looking for qualities of the image that are invariant under any reasonable transformation, such as the addition of subtitles or a change in colour cast.

To help, Brostein and co have developed a piece of software that they’ve trained to find these invariant features, which turn out largely to be things like the relative position of shapes and features in the image.

The task of matching identical pieces of film is then to analyse each scene in a film, generate its video DNA and then look for other pieces of film with the same DNA.

To recreate the whole film, the software simply glues bits of video DNA together based on common sets of sequences in its DNA, just like DNA sequencing.

That’s a clever idea. Video databases are not so different from DNA databases in terms of the amount of information they hold. So using the techniques of bionformatics to mine them makes good sense.

However, just how well and how quickly their algorithm works in the real world is will be important to ascertain. And they’ll surely face competition in producing the best algorithms for this purpose.

Bronstein and say that their technique could eventually “have an impact similar to that of the Human Genome project in genomic research”. That may be overstating things a little but the video genome idea will certainly appeal to a number of industries, not least those wanting to crack down on video piracy.

Ref: arxiv.org/abs/1003.5320: The Video Genome

Want to go ad free? No ad blockers needed.

Become an Insider
Already an Insder? Log in.

Uh oh–you've read all of your free articles for this month.

Insider Premium
$179.95/yr US PRICE

Want more award-winning journalism? Subscribe to Insider Plus.
  • Insider Plus {! insider.prices.plus !}*

    {! insider.display.menuOptionsLabel !}

    Everything included in Insider Basic, plus ad-free web experience, select discounts to partner offerings and MIT Technology Review events

    See details+

    What's Included

    Bimonthly home delivery and unlimited 24/7 access to MIT Technology Review’s website.

    The Download. Our daily newsletter of what's important in technology and innovation.

    Access to the Magazine archive. Over 24,000 articles going back to 1899 at your fingertips.

    Special Discounts to select partner offerings

    Discount to MIT Technology Review events

    Ad-free web experience

/
You've read all of your free articles this month. This is your last free article this month. You've read of free articles this month. or  for unlimited online access.