Urban Image Analysis with Convolutional Sparse Coding

dc.contributor.advisorWonka, Peter
dc.contributor.authorAffara, Lama Ahmed
dc.contributor.committeememberHeidrich, Wolfgang
dc.contributor.committeememberGhanem, Bernard
dc.contributor.committeememberWright, John
dc.contributor.departmentComputer, Electrical and Mathematical Science and Engineering (CEMSE) Division
dc.date.accessioned2018-09-19T09:07:50Z
dc.date.available2018-09-19T09:07:50Z
dc.date.issued2018-09-18
dc.description.abstractUrban image analysis is one of the most important problems lying at the intersection of computer graphics and computer vision research. In addition, Convolutional Sparse Coding (CSC) is a well-established image representation model especially suited for image restoration tasks. This dissertation handles urban image analysis using an asset extraction framework, studies CSC for the reconstruction of both urban and general images using supervised data, and proposes a better computational approach to CSC. Our asset extraction framework uses object proposals which are currently used for increasing the computational efficiency of object detection. In this dissertation, we propose a novel adaptive pipeline for interleaving object proposals with object classification and use it as a formulation for asset detection. We first preprocess the images using a novel and efficient rectification technique. We then employ a particle filter approach to keep track of three priors, which guide proposed samples and get updated using classifier output. Tests performed on over 1000 urban images demonstrate that our rectification method is faster than existing methods without loss in quality, and that our interleaved proposal method outperforms current state-of-the-art. We further demonstrate that other methods can be improved by incorporating our interleaved proposals. We also extend the applicability of the CSC model by proposing a supervised approach to the problem, which aims at learning discriminative dictionaries instead of purely reconstructive ones. We incorporate a supervised regularization term into the traditional unsupervised CSC objective to encourage the final dictionary elements to be discriminative. Experimental results show that using supervised convolutional learning results in two key advantages. First, we learn more semantically relevant filters in the dictionary and second, we achieve improved image reconstruction on unseen data. We finally present two computational contributions to the state of the art in CSC. First, we significantly speed up the computation by proposing a new optimization framework that tackles the problem in the dual domain. Second, we extend the original formulation to higher dimensions in order to process a wider range of inputs, such as RGB images and videos. Our results show up to 20 times speedup compared to current state-of-the-art CSC solvers.
dc.identifier.citationAffara, L. A. (2018). Urban Image Analysis with Convolutional Sparse Coding. KAUST Research Repository. https://doi.org/10.25781/KAUST-CPY94
dc.identifier.doi10.25781/KAUST-CPY94
dc.identifier.urihttp://hdl.handle.net/10754/628738
dc.language.isoen
dc.person.id129117
dc.subjecturban image analysis
dc.subjectImage processing
dc.subjectComputer Vision
dc.subjectSparse Coding
dc.subjectObjects detection
dc.titleUrban Image Analysis with Convolutional Sparse Coding
dc.typeDissertation
display.details.left<span><h5>Type</h5>Dissertation<br><br><h5>Authors</h5><a href="https://repository.kaust.edu.sa/search?query=orcid.id:0000-0001-6950-5347&spc.sf=dc.date.issued&spc.sd=DESC">Affara, Lama Ahmed</a> <a href="https://orcid.org/0000-0001-6950-5347" target="_blank"><img src="https://repository.kaust.edu.sa/server/api/core/bitstreams/82a625b4-ed4b-40c8-865a-d6a5225a26a4/content" width="16" height="16"/></a><br><br><h5>Advisors</h5><a href="https://repository.kaust.edu.sa/search?query=orcid.id:0000-0003-0627-9746&spc.sf=dc.date.issued&spc.sd=DESC">Wonka, Peter</a> <a href="https://orcid.org/0000-0003-0627-9746" target="_blank"><img src="https://repository.kaust.edu.sa/server/api/core/bitstreams/82a625b4-ed4b-40c8-865a-d6a5225a26a4/content" width="16" height="16"/></a><br><br><h5>Committee Members</h5><a href="https://repository.kaust.edu.sa/search?query=orcid.id:0000-0002-4227-8508&spc.sf=dc.date.issued&spc.sd=DESC">Heidrich, Wolfgang</a> <a href="https://orcid.org/0000-0002-4227-8508" target="_blank"><img src="https://repository.kaust.edu.sa/server/api/core/bitstreams/82a625b4-ed4b-40c8-865a-d6a5225a26a4/content" width="16" height="16"/></a><br><a href="https://repository.kaust.edu.sa/search?query=orcid.id:0000-0002-5534-587X&spc.sf=dc.date.issued&spc.sd=DESC">Ghanem, Bernard</a> <a href="https://orcid.org/0000-0002-5534-587X" target="_blank"><img src="https://repository.kaust.edu.sa/server/api/core/bitstreams/82a625b4-ed4b-40c8-865a-d6a5225a26a4/content" width="16" height="16"/></a><br>Wright, John<br><br><h5>Program</h5><a href="https://repository.kaust.edu.sa/search?spc.sf=dc.date.issued&spc.sd=DESC&f.program=Computer Science,equals">Computer Science</a><br><br><h5>KAUST Department</h5><a href="https://repository.kaust.edu.sa/search?spc.sf=dc.date.issued&spc.sd=DESC&f.department=Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division,equals">Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division</a><br><br><h5>Date</h5>2018-09-18</span>
display.details.right<span><h5>Abstract</h5>Urban image analysis is one of the most important problems lying at the intersection of computer graphics and computer vision research. In addition, Convolutional Sparse Coding (CSC) is a well-established image representation model especially suited for image restoration tasks. This dissertation handles urban image analysis using an asset extraction framework, studies CSC for the reconstruction of both urban and general images using supervised data, and proposes a better computational approach to CSC. Our asset extraction framework uses object proposals which are currently used for increasing the computational efficiency of object detection. In this dissertation, we propose a novel adaptive pipeline for interleaving object proposals with object classification and use it as a formulation for asset detection. We first preprocess the images using a novel and efficient rectification technique. We then employ a particle filter approach to keep track of three priors, which guide proposed samples and get updated using classifier output. Tests performed on over 1000 urban images demonstrate that our rectification method is faster than existing methods without loss in quality, and that our interleaved proposal method outperforms current state-of-the-art. We further demonstrate that other methods can be improved by incorporating our interleaved proposals. We also extend the applicability of the CSC model by proposing a supervised approach to the problem, which aims at learning discriminative dictionaries instead of purely reconstructive ones. We incorporate a supervised regularization term into the traditional unsupervised CSC objective to encourage the final dictionary elements to be discriminative. Experimental results show that using supervised convolutional learning results in two key advantages. First, we learn more semantically relevant filters in the dictionary and second, we achieve improved image reconstruction on unseen data. We finally present two computational contributions to the state of the art in CSC. First, we significantly speed up the computation by proposing a new optimization framework that tackles the problem in the dual domain. Second, we extend the original formulation to higher dimensions in order to process a wider range of inputs, such as RGB images and videos. Our results show up to 20 times speedup compared to current state-of-the-art CSC solvers.<br><br><h5>Citation</h5>Affara, L. A. (2018). Urban Image Analysis with Convolutional Sparse Coding. KAUST Research Repository. https://doi.org/10.25781/KAUST-CPY94<br><br><h5>DOI</h5><a href="https://doi.org/10.25781/KAUST-CPY94">10.25781/KAUST-CPY94</a></span>
orcid.id0000-0002-5534-587X
orcid.id0000-0002-4227-8508
orcid.id0000-0003-0627-9746
orcid.id0000-0001-6950-5347
refterms.dateFOA2018-09-19T09:07:53Z
thesis.degree.disciplineComputer Science
thesis.degree.grantorKing Abdullah University of Science and Technology
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
ThesisReport_LamaAffara.pdf
Size:
10.77 MB
Format:
Adobe Portable Document Format
Description:
Dissertation
Name:
copyright form.pdf
Size:
511.07 KB
Format:
Adobe Portable Document Format
Description:
copyright
Name:
approval form.pdf
Size:
474.04 KB
Format:
Adobe Portable Document Format
Description:
Final approval form