Deep Learning Architectures for 2D and 3D Scene Perception

Scene understanding is a fundamental problem in computer vision tasks, that is being more intensively explored in recent years with the development of deep learning. In this dissertation, we proposed deep learning structures to address challenges in 2D and 3D scene perception. We developed several n...

Full description

Saved in:

Bibliographic Details
Main Author:	Bao, Rina
Format:	Dissertation
Language:	English
Published:	ProQuest Dissertations & Theses 01-01-2021
Subjects:	Artificial intelligence Computer Engineering Computer science Information science Information Technology
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	Scene understanding is a fundamental problem in computer vision tasks, that is being more intensively explored in recent years with the development of deep learning. In this dissertation, we proposed deep learning structures to address challenges in 2D and 3D scene perception. We developed several novel architectures for 3D point cloud understanding at city-scale point by effectively capturing both long-range and short-range information to handle the challenging problem of large variations in object size for city-scale point cloud segmentation. GLSNet++ is a two-branch network for multiscale point cloud segmentation that models this complex problem using both global and local processing streams to capture different levels of contextual and structural 3D point cloud information. We developed PointGrad, a new graph convolution gradient operator for capturing structural relationships, that encoded point-based directional gradients into a high-dimensional multiscale tensor space. Using the PointGrad operator with graph convolution on scattered irregular point sets captures the salient structural information in the point cloud across spatial and feature scale space, enabling efficient learning. We integrated PointGrad with several deep network architectures for large-scale 3D point cloud semantic segmentation, including indoor scene and object part segmentation. In many real application areas including remote sensing and aerial imaging, the class imbalance is common and sufficient data for rare classes is hard to acquire or has high-cost associated with expert labeling. We developed MDXNet for few-shot and zero-shot learning, which emulates the human visual system by leveraging multi-domain knowledge from general visual primitives with transfer learning for more specialized learning tasks in various application domains. We extended deep learning methods in various domains, including the material domain for predicting carbon nanotube forest attributes and mechanical properties, biomedical domain for cell segmentation.
AbstractList	Scene understanding is a fundamental problem in computer vision tasks, that is being more intensively explored in recent years with the development of deep learning. In this dissertation, we proposed deep learning structures to address challenges in 2D and 3D scene perception. We developed several novel architectures for 3D point cloud understanding at city-scale point by effectively capturing both long-range and short-range information to handle the challenging problem of large variations in object size for city-scale point cloud segmentation. GLSNet++ is a two-branch network for multiscale point cloud segmentation that models this complex problem using both global and local processing streams to capture different levels of contextual and structural 3D point cloud information. We developed PointGrad, a new graph convolution gradient operator for capturing structural relationships, that encoded point-based directional gradients into a high-dimensional multiscale tensor space. Using the PointGrad operator with graph convolution on scattered irregular point sets captures the salient structural information in the point cloud across spatial and feature scale space, enabling efficient learning. We integrated PointGrad with several deep network architectures for large-scale 3D point cloud semantic segmentation, including indoor scene and object part segmentation. In many real application areas including remote sensing and aerial imaging, the class imbalance is common and sufficient data for rare classes is hard to acquire or has high-cost associated with expert labeling. We developed MDXNet for few-shot and zero-shot learning, which emulates the human visual system by leveraging multi-domain knowledge from general visual primitives with transfer learning for more specialized learning tasks in various application domains. We extended deep learning methods in various domains, including the material domain for predicting carbon nanotube forest attributes and mechanical properties, biomedical domain for cell segmentation.
Author	Bao, Rina
Author_xml	– sequence: 1 givenname: Rina surname: Bao fullname: Bao, Rina
BookMark	eNqNyr0KwjAUQOGACv71HS44C2lSm8RNrOLgIOhearzViNzUJH1_HXwApzN8Z8qG5AkHLDPKaF3kSpcmV2OWxeiunHMjJS_EhK0rxA6O2ARydIdNsA-X0KY-YITWBxAVNHQDWcHZIiGcMFjskvM0Z6O2eUXMfp2xxX532R6WXfDvHmOqn74P9KVaKK5LLlZSy_-uDzrEOGw
ContentType	Dissertation
Copyright	Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Copyright_xml	– notice: Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
DBID	04Z 053 054 0BH 0NN AMEAF CBPLH EU9 G20 M8- P6D PQEST PQQKQ PQUKI
DatabaseName	Dissertations & Theses Europe Full Text: Business Dissertations & Theses Europe Full Text: Science & Technology Dissertations & Theses Europe Full Text: Social Sciences ProQuest Dissertations and Theses Professional Dissertations & Theses @ University of Missouri - Columbia ProQuest Dissertations & Theses Global: The Humanities and Social Sciences Collection ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest Dissertations & Theses A&I ProQuest Dissertations & Theses Global ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest Dissertations and Theses A&I: The Humanities and Social Sciences Collection ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition
DatabaseTitle	Dissertations & Theses @ University of Missouri - Columbia ProQuest Dissertations & Theses Global: The Humanities and Social Sciences Collection ProQuest One Academic Eastern Edition ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest Dissertations and Theses Professional ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest Dissertations & Theses Global Dissertations & Theses Europe Full Text: Science & Technology Dissertations & Theses Europe Full Text: Social Sciences ProQuest One Academic UKI Edition ProQuest Dissertations and Theses A&I: The Humanities and Social Sciences Collection Dissertations & Theses Europe Full Text: Business ProQuest One Academic ProQuest Dissertations & Theses A&I
DatabaseTitleList	Dissertations & Theses @ University of Missouri - Columbia
Database_xml	– sequence: 1 dbid: G20 name: ProQuest Dissertations & Theses Global url: https://www.proquest.com/pqdtglobal1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
Genre	Dissertation/Thesis
GroupedDBID	04Z 053 054 0BH 0NN 8R4 8R5 AMEAF CBPLH EU9 G20 M8- P6D PQEST PQQKQ PQUKI Q2X
ID	FETCH-proquest_journals_27086025383
IEDL.DBID	G20
ISBN	9798841786917
IngestDate	Thu Oct 10 20:24:00 EDT 2024
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-proquest_journals_27086025383
PQID	2708602538
PQPubID	18750
ParticipantIDs	proquest_journals_2708602538
PublicationCentury	2000
PublicationDate	20210101
PublicationDateYYYYMMDD	2021-01-01
PublicationDate_xml	– month: 01 year: 2021 text: 20210101 day: 01
PublicationDecade	2020
PublicationYear	2021
Publisher	ProQuest Dissertations & Theses
Publisher_xml	– name: ProQuest Dissertations & Theses
SSID	ssib000933042
Score	3.8945827
Snippet	Scene understanding is a fundamental problem in computer vision tasks, that is being more intensively explored in recent years with the development of deep...
SourceID	proquest
SourceType	Aggregation Database
SubjectTerms	Artificial intelligence Computer Engineering Computer science Information science Information Technology
Title	Deep Learning Architectures for 2D and 3D Scene Perception
URI	https://www.proquest.com/docview/2708602538
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LSwMxEB5svYhCfeKjSkCvwWySNrseFDEtPYmgB28l2R29bWvX_n8zIat76sVzSBhI5pvJvD6AG58rlBV6rmXpqSVHcI9G8xGOgsXwxhcxmDN7Nc_vuZ3QmJz7theGyipbTIxAXS1KipHfSiOILino58PyixNrFGVXE4VGD7Zp0Fmkbui6P-1vfa-gsVw6M_m4SNRkXdiNtmQ6-K8U-7BrO0n0A9jC-hAGLT0DS9p6BHcWccnSANVP9thJGTQs-KpMWubqiikb9gTIYy-_RS7HcD2dvD3NeCvePL25Zv4nmzqBfr2o8RSYdsKNhXMZ6lIbFE7kqEWpPoQpMFPmDIabTjrfvHwBO5JqPGJIYgj979UaL6HXVOureBM_zr2XNA
link.rule.ids	312,782,786,787,11657,11697,34256,34258,44058,74582,79430
linkProvider	ProQuest
linkToHtml	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwED7RMoCKVJ7iUcASrBFO7NYJAwiRliBKhUQHtshOrmxpIe3_x2clkKkLs2XrJPu-O9_rA7g2ocAgR-PJIDPUksM9g0p6fexbi2GUiVwwJ3lXk48wHtKYnLu6F4bKKmtMdECdzzOKkd8EihNdktXP-8WXR6xRlF2tKDRasCmt50ElXU9N96f-re9ENJZL-iocRBU1WRN2nS0Zdf8rxS504kYSfQ82sNiHbk3PwCptPYDbGHHBqgGqn-yhkTIomfVVWRAzXeRMxHaPhTz29lvkcghXo-H0MfFq8dLqzZXpn2ziCNrFvMBjYFJzPeBa-ygzqZBrHqLkmZhxFaEv1An01p10un75EraS6es4HT9PXs5gO6B6Dxee6EF7-b3Cc2iV-erC3coPBeCaHA
linkToPdf	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LSwMxEB5sBRGF-sRH1YBel2Y3abPrRcR0qQ9KQQ_elmR36m1bXfv_zSxZ3VNPnkPCkGS-Seb1AdzYWGBUoA1klFsqyeGBRSWDIQ6dxbDKJrUzZ_Kqpu-xHlObnKemFobSKhtMrIG6WOTkIx9EihNdktPPwdynRcx0erf8DIhBiiKtnk6jA5uKgkFU-Nt-CjU_992EWnTJUMWjxNOUtSG4titp7z8l2oMd3Qqu78MGlgfQa2gbmNfiQ7jViEvmG6t-sPtWKKFi7g3LIs1MWTCh3RwHhWz2m_xyBNfp-O1hEjSiZv4uVtmfnOIYuuWixBNg0nAz4saEKHOpkBseo-S5mHOVYCjUKfTXrXS2fvgKttxWZC-P0-dz2I4oDaT2WvSh-_21wgvoVMXqsj6gH0jkot8
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Deep+Learning+Architectures+for+2D+and+3D+Scene+Perception&rft.DBID=04Z%3B053%3B054%3B0BH%3B0NN%3BAMEAF%3BCBPLH%3BEU9%3BG20%3BM8-%3BP6D%3BPQEST%3BPQQKQ%3BPQUKI&rft.PQPubID=18750&rft.au=Bao%2C+Rina&rft.date=2021-01-01&rft.pub=ProQuest+Dissertations+%26+Theses&rft.isbn=9798841786917&rft.externalDBID=HAS_PDF_LINK
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798841786917/lc.gif&client=summon&freeimage=true
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798841786917/mc.gif&client=summon&freeimage=true
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9798841786917/sc.gif&client=summon&freeimage=true