18GAY国产小鲜肉可播放_精品国产91久久久久久久下载_亚洲人成AⅤ在线播放_国产在线精品不卡一区二区_成人亚洲午夜精品A片一区二区_永久免费观看美女全网站_国偷自产一区二区免费

Visual odometry aims to track the incremental motion of an object using the information captured by visual sensors. In this work, we study the point cloud odometry problem, where only the point cloud scans obtained by the LiDAR (Light Detection And Ranging) are used to estimate object's motion trajectory. A lightweight point cloud odometry solution is proposed and named the green point cloud odometry (GPCO) method. GPCO is an unsupervised learning method that predicts object motion by matching features of consecutive point cloud scans. It consists of three steps. First, a geometry-aware point sampling scheme is used to select discriminant points from the large point cloud. Second, the view is partitioned into four regions surrounding the object, and the PointHop++ method is used to extract point features. Third, point correspondences are established to estimate object motion between two consecutive scans. Experiments on the KITTI dataset are conducted to demonstrate the effectiveness of the GPCO method. It is observed that GPCO outperforms benchmarking deep learning methods in accuracy while it has a significantly smaller model size and less training time.

相關內容

點云

關注 48

根據(ju)激光測(ce)(ce)量(liang)(liang)原理(li)得到的(de)(de)點(dian)(dian)云，包括(kuo)(kuo)三維坐(zuo)標（XYZ）和(he)激光反(fan)射(she)強度（Intensity）。根據(ju)攝(she)影測(ce)(ce)量(liang)(liang)原理(li)得到的(de)(de)點(dian)(dian)云，包括(kuo)(kuo)三維坐(zuo)標（XYZ）和(he)顏(yan)色(se)信(xin)息（RGB）。結(jie)合(he)(he)激光測(ce)(ce)量(liang)(liang)和(he)攝(she)影測(ce)(ce)量(liang)(liang)原理(li)得到點(dian)(dian)云，包括(kuo)(kuo)三維坐(zuo)標（XYZ）、激光反(fan)射(she)強度（Intensity）和(he)顏(yan)色(se)信(xin)息（RGB）。在獲取物體(ti)表面每個采樣(yang)點(dian)(dian)的(de)(de)空間坐(zuo)標后，得到的(de)(de)是一(yi)個點(dian)(dian)的(de)(de)集合(he)(he)，稱(cheng)之為(wei)“點(dian)(dian)云”(Point Cloud)

Performer · 損失函數（機器學習） · Networking · Neural Networks · 3D ·

2022 年 2 月 8 日

SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks

Yan Xu,Zhaoyang Huang,Kwan-Yee Lin,Xinge Zhu,Jianping Shi,Hujun Bao,Guofeng Zhang,Hongsheng Li

from arxiv, Accepted to CoRL 2020

Recent learning-based LiDAR odometry methods have demonstrated their competitiveness. However, most methods still face two substantial challenges: 1) the 2D projection representation of LiDAR data cannot effectively encode 3D structures from the point clouds; 2) the needs for a large amount of labeled data for training limit the application scope of these methods. In this paper, we propose a self-supervised LiDAR odometry method, dubbed SelfVoxeLO, to tackle these two difficulties. Specifically, we propose a 3D convolution network to process the raw LiDAR data directly, which extracts features that better encode the 3D geometric patterns. To suit our network to self-supervised learning, we design several novel loss functions that utilize the inherent properties of LiDAR point clouds. Moreover, an uncertainty-aware mechanism is incorporated in the loss functions to alleviate the interference of moving objects/noises. We evaluate our method's performances on two large-scale datasets, i.e., KITTI and Apollo-SouthBay. Our method outperforms state-of-the-art unsupervised methods by 27%/32% in terms of translational/rotational errors on the KITTI dataset and also performs well on the Apollo-SouthBay dataset. By including more unlabelled training data, our method can further improve performance comparable to the supervised methods.

FAST · 可約的 · 傳感器 · 代價函數 · 查準率/準確率 ·

2022 年 2 月 4 日

Fast and Accurate Extrinsic Calibration for Multiple LiDARs and Cameras

Xiyuan Liu,Chongjian Yuan,Fu Zhang

from arxiv, 10 pages, 15 figures

The combination of multiple sensors is becoming necessary in robotic applications as each sensor could complement the weakness of others. Determining a precise extrinsic parameter in a fast and reliable manner between multiple sensors is essential and remains challenging. In this paper, we propose a fast, accurate, and targetless extrinsic calibration method for multiple LiDARs and cameras based on adaptive voxelization. On the theory level, we incorporate the LiDAR extrinsic calibration with the bundle adjustment method. We derive the derivatives of the cost function w.r.t. the extrinsic parameter to accelerate the optimization. On the implementation level, we apply adaptive voxelization to reduce the computation time in the process of feature correspondence matching. The robustness and accuracy of our proposed method have been verified with experiments in outdoor test scenes under multiple LiDAR-camera configurations.

點云 · 無監督 · 異常點 · Performer · Better ·

2022 年 2 月 4 日

From noisy point clouds to complete ear shapes: unsupervised pipeline

Filipa Valdeira,Ricardo Ferreira,Alessandra Micheletti,Cláudia Soares

Ears are a particularly difficult region of the human face to model, not only due to the non-rigid deformations existing between shapes but also to the challenges in processing the retrieved data. The first step towards obtaining a good model is to have complete scans in correspondence, but these usually present a higher amount of occlusions, noise and outliers when compared to most face regions, thus requiring a specific procedure. Therefore, we propose a complete pipeline taking as input unordered 3D point clouds with the aforementioned problems, and producing as output a dataset in correspondence, with completion of the missing data. We provide a comparison of several state-of-the-art registration methods and propose a new approach for one of the steps of the pipeline, with better performance for our data.

無監督 · 無監督學習 · 學成 · state-of-the-art · Pair ·

2021 年 4 月 7 日

Warp Consistency for Unsupervised Learning of Dense Correspondences

Prune Truong,Martin Danelljan,Fisher Yu,Luc Van Gool

from arxiv, code: //github.com/PruneTruong/DenseMatching

The key challenge in learning dense correspondences lies in the lack of ground-truth matches for real image pairs. While photometric consistency losses provide unsupervised alternatives, they struggle with large appearance changes, which are ubiquitous in geometric and semantic matching tasks. Moreover, methods relying on synthetic training pairs often suffer from poor generalisation to real data. We propose Warp Consistency, an unsupervised learning objective for dense correspondence regression. Our objective is effective even in settings with large appearance and view-point changes. Given a pair of real images, we first construct an image triplet by applying a randomly sampled warp to one of the original images. We derive and analyze all flow-consistency constraints arising between the triplet. From our observations and empirical results, we design a general unsupervised objective employing two of the derived constraints. We validate our warp consistency loss by training three recent dense correspondence networks for the geometric and semantic matching tasks. Our approach sets a new state-of-the-art on several challenging benchmarks, including MegaDepth, RobotCar and TSS. Code and models will be released at //github.com/PruneTruong/DenseMatching.

SGP · Performer · 優化器 · MoDELS · 描述符 ·

2021 年 3 月 4 日

Self-supervised Geometric Perception

Heng Yang,Wei Dong,Luca Carlone,Vladlen Koltun

from arxiv, CVPR 2021, Oral presentation. 8 pages main results, 19 pages in total, including references and supplementary

We present self-supervised geometric perception (SGP), the first general framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels (e.g., camera poses, rigid transformations). Our first contribution is to formulate geometric perception as an optimization problem that jointly optimizes the feature descriptor and the geometric models given a large corpus of visual measurements (e.g., images, point clouds). Under this optimization formulation, we show that two important streams of research in vision, namely robust model fitting and deep feature learning, correspond to optimizing one block of the unknown variables while fixing the other block. This analysis naturally leads to our second contribution -- the SGP algorithm that performs alternating minimization to solve the joint optimization. SGP iteratively executes two meta-algorithms: a teacher that performs robust model fitting given learned features to generate geometric pseudo-labels, and a student that performs deep feature learning under noisy supervision of the pseudo-labels. As a third contribution, we apply SGP to two perception problems on large-scale real datasets, namely relative camera pose estimation on MegaDepth and point cloud registration on 3DMatch. We demonstrate that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.

估計/估計量 · SCAN · Extensibility · 3D · 穩健性 ·

2021 年 1 月 17 日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Jiahui Huang,He Wang,Tolga Birdal,Minhyuk Sung,Federica Arrigoni,Shi-Min Hu,Leonidas Guibas

from arxiv, Contact: huang-jh18<at>mails<dot>tsinghua<dot>edu<dot>cn

We present MultiBodySync, a novel, end-to-end trainable multi-body motion segmentation and rigid registration framework for multiple input 3D point clouds. The two non-trivial challenges posed by this multi-scan multibody setting that we investigate are: (i) guaranteeing correspondence and segmentation consistency across multiple input point clouds capturing different spatial arrangements of bodies or body parts; and (ii) obtaining robust motion-based rigid body segmentation applicable to novel object categories. We propose an approach to address these issues that incorporates spectral synchronization into an iterative deep declarative network, so as to simultaneously recover consistent correspondences as well as motion segmentation. At the same time, by explicitly disentangling the correspondence and motion segmentation estimation modules, we achieve strong generalizability across different object categories. Our extensive evaluations demonstrate that our method is effective on various datasets ranging from rigid parts in articulated objects to individually moving objects in a 3D scene, be it single-view or full point clouds.

Performer · 變換 · 目標檢測 · 無監督 · Faster R-CNN ·

2020 年 11 月 18 日

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Zhigang Dai,Bolun Cai,Yugeng Lin,Junying Chen

Object detection with transformers (DETR) reaches competitive performance with Faster R-CNN via a transformer encoder-decoder architecture. Inspired by the great success of pre-training transformers in natural language processing, we propose a pretext task named random query patch detection to unsupervisedly pre-train DETR (UP-DETR) for object detection. Specifically, we randomly crop patches from the given image and then feed them as queries to the decoder. The model is pre-trained to detect these query patches from the original image. During the pre-training, we address two critical issues: multi-task learning and multi-query localization. (1) To trade-off multi-task learning of classification and localization in the pretext task, we freeze the CNN backbone and propose a patch feature reconstruction branch which is jointly optimized with patch detection. (2) To perform multi-query localization, we introduce UP-DETR from single-query patch and extend it to multi-query patches with object query shuffle and attention mask. In our experiments, UP-DETR significantly boosts the performance of DETR with faster convergence and higher precision on PASCAL VOC and COCO datasets. The code will be available soon.

估計/估計量 · 狀態估計 · Performer · 穩健性 · Extensibility ·

2019 年 8 月 22 日

R-LINS: A Robocentric Lidar-Inertial State Estimator for Robust and Efficient Navigation

Chao Qin,Haoyang Ye,Christian E. Pranata,Jun Han,Shuyang Zhang,Ming Liu

We present R-LINS, a lightweight robocentric lidar-inertial state estimator, which estimates robot ego-motion using a 6-axis IMU and a 3D lidar in a tightly-coupled scheme. To achieve robustness and computational efficiency even in challenging environments, an iterated error-state Kalman filter (ESKF) is designed, which recursively corrects the state via repeatedly generating new corresponding feature pairs. Moreover, a novel robocentric formulation is adopted in which we reformulate the state estimator concerning a moving local frame, rather than a fixed global frame as in the standard world-centric lidar-inertial odometry(LIO), in order to prevent filter divergence and lower computational cost. To validate generalizability and long-time practicability, extensive experiments are performed in indoor and outdoor scenarios. The results indicate that R-LINS outperforms lidar-only and loosely-coupled algorithms, and achieve competitive performance as the state-of-the-art LIO with close to an order-of-magnitude improvement in terms of speed.

點云 · 3D · 正則的 · Extensibility · 自下而上 ·

2018 年 12 月 11 日

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud

Shaoshuai Shi,Xiaogang Wang,Hongsheng Li

In this paper, we propose PointRCNN for 3D object detection from raw point cloud. The whole framework is composed of two stages: stage-1 for the bottom-up 3D proposal generation and stage-2 for refining proposals in the canonical coordinates to obtain the final detection results. Instead of generating proposals from RGB image or projecting point cloud to bird's view or voxels as previous methods do, our stage-1 sub-network directly generates a small number of high-quality 3D proposals from point cloud in a bottom-up manner via segmenting the point cloud of whole scene into foreground points and background. The stage-2 sub-network transforms the pooled points of each proposal to canonical coordinates to learn better local spatial features, which is combined with global semantic features of each point learned in stage-1 for accurate box refinement and confidence prediction. Extensive experiments on the 3D detection benchmark of KITTI dataset show that our proposed architecture outperforms state-of-the-art methods with remarkable margins by using only point cloud as input.

3D · 學成 · 監督 · INFORMS · Performer ·

2018 年 11 月 15 日

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Paul Henderson,Vittorio Ferrari

from arxiv, BMVC 2018 (Oral). Differentiable renderer available at //github.com/pmh47/dirt

We present a unified framework tackling two problems: class-specific 3D reconstruction from a single image, and generation of new 3D shape samples. These tasks have received considerable attention recently; however, existing approaches rely on 3D supervision, annotation of 2D images with keypoints or poses, and/or training with multiple views of each object instance. Our framework is very general: it can be trained in similar settings to these existing approaches, while also supporting weaker supervision scenarios. Importantly, it can be trained purely from 2D images, without ground-truth pose annotations, and with a single view per instance. We employ meshes as an output representation, instead of voxels used in most prior work. This allows us to exploit shading information during training, which previous 2D-supervised methods cannot. Thus, our method can learn to generate and reconstruct concave object classes. We evaluate our approach on synthetic data in various settings, showing that (i) it learns to disentangle shape from pose; (ii) using shading in the loss improves performance; (iii) our model is comparable or superior to state-of-the-art voxel-based approaches on quantitative metrics, while producing results that are visually more pleasing; (iv) it still performs well when given supervision weaker than in prior works.