FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective metho...

Full description

Saved in:
Bibliographic Details
Main Authors: Rukhovich, Danila, Vorontsova, Anna, Konushin, Anton
Format: Journal Article
Language:English
Published: 01-12-2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective method that uses a voxel representation of a point cloud and processes voxels with sparse convolutions. FCAF3D can handle large-scale scenes with minimal runtime through a single fully convolutional feed-forward pass. Existing 3D object detection methods make prior assumptions on the geometry of objects, and we argue that it limits their generalization ability. To get rid of any prior assumptions, we propose a novel parametrization of oriented bounding boxes that allows obtaining better results in a purely data-driven way. The proposed method achieves state-of-the-art 3D object detection results in terms of mAP@0.5 on ScanNet V2 (+4.5), SUN RGB-D (+3.5), and S3DIS (+20.5) datasets. The code and models are available at https://github.com/samsunglabs/fcaf3d.
AbstractList Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective method that uses a voxel representation of a point cloud and processes voxels with sparse convolutions. FCAF3D can handle large-scale scenes with minimal runtime through a single fully convolutional feed-forward pass. Existing 3D object detection methods make prior assumptions on the geometry of objects, and we argue that it limits their generalization ability. To get rid of any prior assumptions, we propose a novel parametrization of oriented bounding boxes that allows obtaining better results in a purely data-driven way. The proposed method achieves state-of-the-art 3D object detection results in terms of mAP@0.5 on ScanNet V2 (+4.5), SUN RGB-D (+3.5), and S3DIS (+20.5) datasets. The code and models are available at https://github.com/samsunglabs/fcaf3d.
Author Konushin, Anton
Rukhovich, Danila
Vorontsova, Anna
Author_xml – sequence: 1
  givenname: Danila
  surname: Rukhovich
  fullname: Rukhovich, Danila
– sequence: 2
  givenname: Anna
  surname: Vorontsova
  fullname: Vorontsova, Anna
– sequence: 3
  givenname: Anton
  surname: Konushin
  fullname: Konushin, Anton
BackLink https://doi.org/10.48550/arXiv.2112.00322$$DView paper in arXiv
BookMark eNotj7FOwzAURT3AUAofwFT_QILz7Dg2TFFCClKlLt0jx3kWQcZGJq3o3xMK01murs65IVchBiTkvmC5UGXJHkz6nk45FAXkjHGAFXnqmrrj7SPtjt6faRPDKfrjPMVgPK2DfYsp6xIi5S3dD-9oZ9rivGBZ3JJrZ_wX3v1zTQ7d86F5yXb77WtT7zIjK8jGgY9aSVBaM-cqq40epba2sgyMABROKCuVlEZKdBpwEa2ckLocFgDyNdn83V7s-880fZh07n8r-ksF_wHff0HV
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2112.00322
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2112_00322
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a672-db3d98628990ff7c9a9d69cc7c02a42e4f48c6866a66ef92e5507f4695b7f42e3
IEDL.DBID GOX
IngestDate Mon Jan 08 05:39:32 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a672-db3d98628990ff7c9a9d69cc7c02a42e4f48c6866a66ef92e5507f4695b7f42e3
OpenAccessLink https://arxiv.org/abs/2112.00322
ParticipantIDs arxiv_primary_2112_00322
PublicationCentury 2000
PublicationDate 2021-12-01
PublicationDateYYYYMMDD 2021-12-01
PublicationDate_xml – month: 12
  year: 2021
  text: 2021-12-01
  day: 01
PublicationDecade 2020
PublicationYear 2021
Score 1.8277438
SecondaryResourceType preprint
Snippet Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
URI https://arxiv.org/abs/2112.00322
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdZ07T8MwEMdPtBMLAgEqT3lgjXCdxLFhqpKGTnSgQ7fIT4kloPQh-PY9O0GwMFmyb_HZsu9nn_8GeBDIsV6miKlFENXmzibaCJWwTFBv81xQHR4KL96K17Wo5kEmh_y8hVHd1_u-1wfWm0ekk6CniZNuBCPGQsrWy3LdX05GKa7B_tcOY8xY9WeTqE_hZIjuyKwfjjM4cu05PNflrE6rJxJw75uUH-1-GO9g2uL60yV15xxJK7LU4VyEVG4bU6TaC1jV81W5SIY_CxLFC5ZYnVqJkIAUQ70vjFTScmlMYShTGXOZz4ThgnPFufOSuSAn5hFRc40Fc-kljBH73QQItxiceC01FR4Ziko1DdLw-ZR73OR1fgWT2NPms5elaIITmuiE6_-bbuCYhayMmJBxC-Ntt3N3MNrY3X307QH2RnRq
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=FCAF3D%3A+Fully+Convolutional+Anchor-Free+3D+Object+Detection&rft.au=Rukhovich%2C+Danila&rft.au=Vorontsova%2C+Anna&rft.au=Konushin%2C+Anton&rft.date=2021-12-01&rft_id=info:doi/10.48550%2Farxiv.2112.00322&rft.externalDocID=2112_00322