WEPSAM: Weakly Pre-Learnt Saliency Model

Avisek Lahiri; Sourya Roy; Anirban Santara; Pabitra Mitra; Prabir Kumar Biswas

doi:10.48550/arxiv.1605.01101

Back

WEPSAM: Weakly Pre-Learnt Saliency Model

Preprint

Open access

WEPSAM: Weakly Pre-Learnt Saliency Model

Avisek Lahiri, Sourya Roy, Anirban Santara, Pabitra Mitra and Prabir Kumar Biswas

ArXiv.org

Cornell University

05/03/2016

DOI: 10.48550/arxiv.1605.01101

Files and links (1)

url

https://doi.org/10.48550/arXiv.1605.01101View

Preprint (Author's original)This preprint has not been evaluated by subject experts through peer review. Preprints may undergo extensive changes and/or become peer-reviewed journal articles. Open Access

Abstract

Visual saliency detection tries to mimic human vision psychology which concentrates on sparse, important areas in natural image. Saliency prediction research has been traditionally based on low level features such as contrast, edge, etc. Recent thrust in saliency prediction research is to learn high level semantics using ground truth eye fixation datasets. In this paper we present, WEPSAM : Weakly Pre-Learnt Saliency Model as a pioneering effort of using domain specific pre-learing on ImageNet for saliency prediction using a light weight CNN architecture. The paper proposes a two step hierarchical learning, in which the first step is to develop a framework for weakly pre-training on a large scale dataset such as ImageNet which is void of human eye fixation maps. The second step refines the pre-trained model on a limited set of ground truth fixations. Analysis of loss on iSUN and SALICON datasets reveal that pre-trained network converges much faster compared to randomly initialized network. WEPSAM also outperforms some recent state-of-the-art saliency prediction models on the challenging MIT300 dataset.

Computer Science - Computer Vision and Pattern Recognition

Details

Title: Subtitle: WEPSAM: Weakly Pre-Learnt Saliency Model
Creators: Avisek Lahiri
Sourya Roy
Anirban Santara
Pabitra Mitra
Prabir Kumar Biswas
Resource Type: Preprint
Publication Details: ArXiv.org
DOI: 10.48550/arxiv.1605.01101
ISSN: 2331-8422
Publisher: Cornell University
Language: English
Date posted: 05/03/2016
Academic Unit: Computer Science
Record Identifier: 9984446723102771

Metrics

8 Record Views