This software can help describe what's happening in an image

Researchers from Google and Stanford University has developed software which can describe the scene shown in an image.

WHENEVER YOU SEARCH for an image, you’re normally relying on the descriptions for an image, but new developments in image-recognition software might make it easier to find the type of images you’re looking for.

A collaboration between two teams of researchers from Google and Stanford University is creating software that will help describe the scene happening in an image, instead of just individual objects.

The software teaches itself to recognise and identify entire scenes and describe it in terms that anyone could understand.

How the software accomplishes this is by using two neutral networks. The first one deals with image recognition while the second deals with natural language processing. By using computer learning, which sees it being fed a number of captioned images and learning how the sentences provided relate to what the images show.

The developments could make it easier to group and search for the billions of images and hours of videos that are available online. Currently, Google and other sites rely on written descriptions accompanying an image to figure out what it contains, but this method is able to recognise and describe them without human assistance.

That said, it’s not perfect by any stretch of the imagination. While it’s capable of being accurate, the examples provided below show that there’s still a lot of work to be done before it’s ready for the world, either making minor mistakes or getting it wrong entirely.

Screen+Shot+2014-11-17+at+2.11.11+PM Google Research Blog Google Research Blog

According to the New York Times, the two research teams said they expect to see significant increases in accuracy as they improve their software and train these programs with larger sets of annotated images.

Still, the speed in which image recognition is improving is picking up and perhaps in the near future, we will be able to upload an image or video and it will recognise what’s happening.

Read: Samsung adopts a ‘less is more’ approach for its smartphone business >

Read: Snapchat is letting you send money to friends, but don’t get excited >

Readers like you are keeping these stories free for everyone...

A mix of advertising and supporting contributions helps keep paywalls away from valuable information like this article. Over 5,000 readers like you have already stepped up and support us with a monthly payment or a once-off donation.

Learn More Support The Journal

Google Research Blog

image recognition

Author

Quinton O'Reilly

View 9 comments

Send Tip or Correction

9 Comments

Defamation

Damaging the good reputation of someone, slander, or libel.
Racism or Hate speech

An attack on an individual or group based on religion, race, gender, or beliefs.
Trolling or Off-topic

An attempt to derail the discussion.
Inappropriate language

Profanity, obscenity, vulgarity, or slurs.
Spam

Advertising, phishing, scamming, bots, or repetitive posts.
Other

description

Google

Software

News in 60 seconds