Date of Award

12-31-2016

Document Type

Campus Access Thesis

Degree Name

Master of Science (MS)

Department

Computer Science

First Advisor

Marc Pomplun

Second Advisor

Dan A. Simovici

Third Advisor

Craig Yu

Abstract

Humans perceive their surroundings in great detail even though most of our visual field is reduced to low-fidelity input by the retina. In contrast, most deep learning architectures are computationally wasteful in that they treat every part of the input image as equally important to the underlying image processing task. We know that the human visual system is able to perform visual reasoning despite having only a small fovea of high visual acuity. Attention is the mechanism by which our visual input is filtered and our visual system's limited computational resources are allocated. Interest in incorporating attention mechanisms into existing deep architectures has recently been renewed. In support of such research, we investigate the extent to which existing architectures are able to learn useful feature representations from low acuity or fidelity inputs. Specifically, we apply deep autoencoders to the problem of image super-resolution and find that they can learn remarkably rich hidden representations from low-resolution inputs such as color and shape features from natural images and strokes from digit images. We find that in many cases, the learnt features are suitable for reconstructing the original high resolution image.

Comments

Free and open access to this Campus Access Thesis is made available to the UMass Boston community by ScholarWorks at UMass Boston. Those not on campus and those without a UMass Boston campus username and password may gain access to this thesis through resources like Proquest Dissertations & Theses Global or through Interlibrary Loan. If you have a UMass Boston campus username and password and would like to download this work from off-campus, click on the "Off-Campus UMass Boston Users" link above.

Share

COinS