FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective metho...
Saved in:
Main Authors: | , , |
---|---|
Format: | Journal Article |
Language: | English |
Published: |
01-12-2021
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Recently, promising applications in robotics and augmented reality have
attracted considerable attention to 3D object detection from point clouds. In
this paper, we present FCAF3D - a first-in-class fully convolutional
anchor-free indoor 3D object detection method. It is a simple yet effective
method that uses a voxel representation of a point cloud and processes voxels
with sparse convolutions. FCAF3D can handle large-scale scenes with minimal
runtime through a single fully convolutional feed-forward pass. Existing 3D
object detection methods make prior assumptions on the geometry of objects, and
we argue that it limits their generalization ability. To get rid of any prior
assumptions, we propose a novel parametrization of oriented bounding boxes that
allows obtaining better results in a purely data-driven way. The proposed
method achieves state-of-the-art 3D object detection results in terms of
mAP@0.5 on ScanNet V2 (+4.5), SUN RGB-D (+3.5), and S3DIS (+20.5) datasets. The
code and models are available at https://github.com/samsunglabs/fcaf3d. |
---|---|
AbstractList | Recently, promising applications in robotics and augmented reality have
attracted considerable attention to 3D object detection from point clouds. In
this paper, we present FCAF3D - a first-in-class fully convolutional
anchor-free indoor 3D object detection method. It is a simple yet effective
method that uses a voxel representation of a point cloud and processes voxels
with sparse convolutions. FCAF3D can handle large-scale scenes with minimal
runtime through a single fully convolutional feed-forward pass. Existing 3D
object detection methods make prior assumptions on the geometry of objects, and
we argue that it limits their generalization ability. To get rid of any prior
assumptions, we propose a novel parametrization of oriented bounding boxes that
allows obtaining better results in a purely data-driven way. The proposed
method achieves state-of-the-art 3D object detection results in terms of
mAP@0.5 on ScanNet V2 (+4.5), SUN RGB-D (+3.5), and S3DIS (+20.5) datasets. The
code and models are available at https://github.com/samsunglabs/fcaf3d. |
Author | Konushin, Anton Rukhovich, Danila Vorontsova, Anna |
Author_xml | – sequence: 1 givenname: Danila surname: Rukhovich fullname: Rukhovich, Danila – sequence: 2 givenname: Anna surname: Vorontsova fullname: Vorontsova, Anna – sequence: 3 givenname: Anton surname: Konushin fullname: Konushin, Anton |
BackLink | https://doi.org/10.48550/arXiv.2112.00322$$DView paper in arXiv |
BookMark | eNotj7FOwzAURT3AUAofwFT_QILz7Dg2TFFCClKlLt0jx3kWQcZGJq3o3xMK01murs65IVchBiTkvmC5UGXJHkz6nk45FAXkjHGAFXnqmrrj7SPtjt6faRPDKfrjPMVgPK2DfYsp6xIi5S3dD-9oZ9rivGBZ3JJrZ_wX3v1zTQ7d86F5yXb77WtT7zIjK8jGgY9aSVBaM-cqq40epba2sgyMABROKCuVlEZKdBpwEa2ckLocFgDyNdn83V7s-880fZh07n8r-ksF_wHff0HV |
ContentType | Journal Article |
Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2112.00322 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2112_00322 |
GroupedDBID | AKY GOX |
ID | FETCH-LOGICAL-a672-db3d98628990ff7c9a9d69cc7c02a42e4f48c6866a66ef92e5507f4695b7f42e3 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:39:32 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a672-db3d98628990ff7c9a9d69cc7c02a42e4f48c6866a66ef92e5507f4695b7f42e3 |
OpenAccessLink | https://arxiv.org/abs/2112.00322 |
ParticipantIDs | arxiv_primary_2112_00322 |
PublicationCentury | 2000 |
PublicationDate | 2021-12-01 |
PublicationDateYYYYMMDD | 2021-12-01 |
PublicationDate_xml | – month: 12 year: 2021 text: 2021-12-01 day: 01 |
PublicationDecade | 2020 |
PublicationYear | 2021 |
Score | 1.8277438 |
SecondaryResourceType | preprint |
Snippet | Recently, promising applications in robotics and augmented reality have
attracted considerable attention to 3D object detection from point clouds. In
this... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Computer Vision and Pattern Recognition |
Title | FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection |
URI | https://arxiv.org/abs/2112.00322 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdZ07T8MwEMdPtBMLAgEqT3lgjXCdxLFhqpKGTnSgQ7fIT4kloPQh-PY9O0GwMFmyb_HZsu9nn_8GeBDIsV6miKlFENXmzibaCJWwTFBv81xQHR4KL96K17Wo5kEmh_y8hVHd1_u-1wfWm0ekk6CniZNuBCPGQsrWy3LdX05GKa7B_tcOY8xY9WeTqE_hZIjuyKwfjjM4cu05PNflrE6rJxJw75uUH-1-GO9g2uL60yV15xxJK7LU4VyEVG4bU6TaC1jV81W5SIY_CxLFC5ZYnVqJkIAUQ70vjFTScmlMYShTGXOZz4ThgnPFufOSuSAn5hFRc40Fc-kljBH73QQItxiceC01FR4Ziko1DdLw-ZR73OR1fgWT2NPms5elaIITmuiE6_-bbuCYhayMmJBxC-Ntt3N3MNrY3X307QH2RnRq |
link.rule.ids | 228,230,782,887 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=FCAF3D%3A+Fully+Convolutional+Anchor-Free+3D+Object+Detection&rft.au=Rukhovich%2C+Danila&rft.au=Vorontsova%2C+Anna&rft.au=Konushin%2C+Anton&rft.date=2021-12-01&rft_id=info:doi/10.48550%2Farxiv.2112.00322&rft.externalDocID=2112_00322 |