July 12, 2014

Open-vocabulary Object Retrieval

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract—In this paper, we address the problem of retrieving objects based on open-vocabulary natural language queries: Given a phrase describing a specific object, e.g., “the corn flakes box”, the task is to find the best match in a set of images containing candidate objects. When naming objects, humans tend to use natural language with rich semantics, including basic-level categories, fine-grained categories, and instance-level concepts such as brand names. Existing approaches to large-scale object recognition fail in this scenario, as they expect queries that map directly to a fixed set of pre-trained visual categories, e.g. ImageNet synset tags. We address this limitation by introducing a novel object retrieval method. Given a candidate object image, we first map it to a set of words that are likely to describe it, using several learned image-to-text projections. We also propose a method for handling open-vocabularies, i.e., words not contained in the training data. We then compare the natural language query to the sets of words predicted for each candidate and select the best match. Our method can combine category- and instance-level semantics in a common representation. We present extensive experimental results on several datasets using both instance-level and category-level matching and show that our approach can accurately retrieve objects based on extremely varied open-vocabulary queries. The source code of our approach will be publicly available together with pre-trained models at

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Guadarrama et al. (Sat,) studied this question.

www.synapsesocial.com/papers/6a080191c3448d68e5378b9f — DOI: https://doi.org/10.15607/rss.2014.x.041

Authors

Sergio Guadarrama

Erik Rodner

Ryan Farrell

Actions

Institutions

University of California, Berkeley

Friedrich Schiller University Jena

University of Massachusetts Lowell

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Open-vocabulary Object Retrieval

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion