国产又色又爽又黄又免费软件,久热精品视频在线观看,免费一级毛片激情高潮,欧美日韩激国产情久久久久久久久,久久精品国产99国产精品久久

The paper presents a novel learning-based sampling strategy that guarantees rejection-free sampling of the free space in both biased and uniform conditions. Data of past configurations of the autonomous system performing a repetitive task is leveraged to estimate a non-parametric probabilistic description of the region of the free space where feasible solutions of the motion planning problem are likely to be found. The tuning parameters of the kernel density estimator -- the bandwidth and the kernel -- are then used to properly alter the description of the free space such that no sampled configuration can fall outside the original free space. The paper demonstrates the proposed method on two case studies: the first showcases the sampling strategies on 2D historical data from real surface vessels, whereas the second applies the method on 3D drone data gathered from a real quadrotor system. Both instances show that the proposed biased and approximately uniform sampling schemes are able to guarantee rejection-free sampling of the considered workspaces.

相關內容

采樣法

關注 0

Continuity · 折扣回報 · 優化器 · 強化學習 · 學成 ·

2021 年 11 月 22 日

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

Mingyu Cai,Mohammadhosein Hasanbeig,Shaoping Xiao,Alessandro Abate,Zhen Kan

from arxiv, arXiv admin note: text overlap with arXiv:2010.06797

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces. Linear temporal logic (LTL) is used to specify high-level tasks over infinite horizon, which can be converted into a limit deterministic generalized B\"uchi automaton (LDGBA) with several accepting sets. The novelty is to design an embedded product MDP (EP-MDP) between the LDGBA and the MDP by incorporating a synchronous tracking-frontier function to record unvisited accepting sets of the automaton, and to facilitate the satisfaction of the accepting conditions. The proposed LDGBA-based reward shaping and discounting schemes for the model-free reinforcement learning (RL) only depend on the EP-MDP states and can overcome the issues of sparse rewards. Rigorous analysis shows that any RL method that optimizes the expected discounted return is guaranteed to find an optimal policy whose traces maximize the satisfaction probability. A modular deep deterministic policy gradient (DDPG) is then developed to generate such policies over continuous state and action spaces. The performance of our framework is evaluated via an array of OpenAI gym environments.

控制器 · Neural Networks · 層 · Networking · state-of-the-art ·

2021 年 11 月 22 日

BarrierNet: A Safety-Guaranteed Layer for Neural Networks

Wei Xiao,Ramin Hasani,Xiao Li,Daniela Rus

from arxiv, 23 pages

This paper introduces differentiable higher-order control barrier functions (CBF) that are end-to-end trainable together with learning systems. CBFs are usually overly conservative, while guaranteeing safety. Here, we address their conservativeness by softening their definitions using environmental dependencies without loosing safety guarantees, and embed them into differentiable quadratic programs. These novel safety layers, termed a BarrierNet, can be used in conjunction with any neural network-based controller, and can be trained by gradient descent. BarrierNet allows the safety constraints of a neural controller be adaptable to changing environments. We evaluate them on a series of control problems such as traffic merging and robot navigations in 2D and 3D space, and demonstrate their effectiveness compared to state-of-the-art approaches.

學成 · 回合 · Extensibility · 泛化理論 · Integration ·

2021 年 11 月 22 日

Hybrid Imitative Planning with Geometric and Predictive Costs in Off-road Environments

Nitish Dashora,Daniel Shin,Dhruv Shah,Henry Leopold,David Fan,Ali Agha-Mohammadi,Nicholas Rhinehart,Sergey Levine

Geometric methods for solving open-world off-road navigation tasks, by learning occupancy and metric maps, provide good generalization but can be brittle in outdoor environments that violate their assumptions (e.g., tall grass). Learning-based methods can directly learn collision-free behavior from raw observations, but are difficult to integrate with standard geometry-based pipelines. This creates an unfortunate conflict -- either use learning and lose out on well-understood geometric navigational components, or do not use it, in favor of extensively hand-tuned geometry-based cost maps. In this work, we reject this dichotomy by designing the learning and non-learning-based components in a way such that they can be effectively combined in a self-supervised manner. Both components contribute to a planning criterion: the learned component contributes predicted traversability as rewards, while the geometric component contributes obstacle cost information. We instantiate and comparatively evaluate our system in both in-distribution and out-of-distribution environments, showing that this approach inherits complementary gains from the learned and geometric components and significantly outperforms either of them. Videos of our results are hosted at //sites.google.com/view/hybrid-imitative-planning

Integration · contrastive · 散度 · FAST · Excel ·

2021 年 11 月 21 日

Semiexplicit Symplectic Integrators for Non-separable Hamiltonian Systems

Buddhika Jayawardana,Tomoki Ohsawa

from arxiv, 21 pages, 6 figures

We construct a symplectic integrator for non-separable Hamiltonian systems combining an extended phase space approach of Pihajoki and the symmetric projection method. The resulting method is semiexplicit in the sense that the main time evolution step is explicit whereas the symmetric projection step is implicit. The symmetric projection binds potentially diverging copies of solutions, thereby remedying the main drawback of the extended phase space approach. Moreover, our semiexplicit method is symplectic in the original phase space. This is in contrast to existing extended phase space integrators, which are symplectic only in the extended phase space. We demonstrate that our method exhibits an excellent long-time preservation of invariants, and also that it tends to be as fast as Tao's explicit modified extended phase space integrator particularly with higher-order implementations and for higher-dimensional problems.

Networking · 控制器 · 動力系統 · MoDELS · 生成模型 ·

2021 年 11 月 20 日

Thermodynamical Material Networks for Modeling, Planning and Control of Circular Material Flows

Federico Zocco,Beatrice Smyth,Pantelis Sopasakis

from arxiv, Perspective paper in preparation

Waste production, carbon dioxide atmospheric accumulation and dependence on finite natural resources are expressions of the unsustainability of the current industrial networks that supply fuels, energy and manufacturing products. In particular, circular manufacturing supply chains and carbon control networks are urgently needed. To model and design these and, in general, any material networks, we propose to generalize the approach used for traditional networks such as water and thermal power systems using compartmental dynamical systems thermodynamics, graph theory and the force-voltage analogy. The generalized modeling methodology is explained, then challenges and future research directions are discussed. We hope this paper inspires to use dynamical systems and control, which are typically techniques used for industrial automation, for closing material flows, which is an issue of primary concern in industrial ecology and circular economy.

Performer · 曲率 · 路徑 · Extensibility · 在線 ·

2021 年 11 月 19 日

Online Coverage Planning for an Autonomous Weed Mowing Robot with Curvature Constraints

Parikshit Maini,Burak M. Gonultas,Volkan Isler

The land used for grazing cattle takes up about one-third of the land in the United States. These areas can be highly rugged. Yet, they need to be maintained to prevent weeds from taking over the nutritious grassland. This can be a daunting task especially in the case of organic farming since herbicides cannot be used. In this paper, we present the design of Cowbot, an autonomous weed mowing robot for pastures. Cowbot is an electric mower designed to operate in the rugged environments on cow pastures and provide a cost-effective method for weed control in organic farms. Path planning for the Cowbot is challenging since weed distribution on pastures is unknown. Given a limited field of view, online path planning is necessary to detect weeds and plan paths to mow them. We study the general online path planning problem for an autonomous mower with curvature and field of view constraints. We develop two online path planning algorithms that are able to utilize new information about weeds to optimize path length and ensure coverage. We deploy our algorithms on the Cowbot and perform field experiments to validate the suitability of our methods for real-time path planning. We also perform extensive simulation experiments which show that our algorithms result in up to 60 % reduction in path length as compared to baseline boustrophedon and random-search based coverage paths.

幾乎必然 · Performer · 控制器 · 優化器 · 可辨認的 ·

2021 年 11 月 19 日

Safe Linear-Quadratic Dual Control with Almost Sure Performance Guarantee

Yiwen Lu,Yilin Mo

This paper considers the linear-quadratic dual control problem where the system parameters need to be identified and the control objective needs to be optimized in the meantime. Contrary to existing works on data-driven linear-quadratic regulation, which typically provide error or regret bounds within a certain probability, we propose an online algorithm that guarantees the asymptotic optimality of the controller in the almost sure sense. Our dual control strategy consists of two parts: a switched controller with time-decaying exploration noise and Markov parameter inference based on the cross-correlation between the exploration noise and system output. Central to the almost sure performance guarantee is a safe switched control strategy that falls back to a known conservative but stable controller when the actual state deviates significantly from the target state. We prove that this switching strategy rules out any potential destabilizing controllers from being applied, while the performance gap between our switching strategy and the optimal linear state feedback is exponentially small. Under our dual control scheme, the parameter inference error scales as $O(T^{-1/4+\epsilon})$, while the suboptimality gap of control performance scales as $O(T^{-1/2+\epsilon})$, where $T$ is the number of time steps, and $\epsilon$ is an arbitrarily small positive number. Simulation results on an industrial process example are provided to illustrate the effectiveness of our proposed strategy.

目標檢測 · 點云 · 3D · 可辨認的 · INFORMS ·

2021 年 6 月 21 日

3D Object Detection for Autonomous Driving: A Survey

Rui Qian,Xin Lai,Xirong Li

from arxiv, 3D object detection, Autonomous driving, Point clouds

Autonomous driving is regarded as one of the most promising remedies to shield human beings from severe crashes. To this end, 3D object detection serves as the core basis of such perception system especially for the sake of path planning, motion prediction, collision avoidance, etc. Generally, stereo or monocular images with corresponding 3D point clouds are already standard layout for 3D object detection, out of which point clouds are increasingly prevalent with accurate depth information being provided. Despite existing efforts, 3D object detection on point clouds is still in its infancy due to high sparseness and irregularity of point clouds by nature, misalignment view between camera view and LiDAR bird's eye of view for modality synergies, occlusions and scale variations at long distances, etc. Recently, profound progress has been made in 3D object detection, with a large body of literature being investigated to address this vision task. As such, we present a comprehensive review of the latest progress in this field covering all the main topics including sensors, fundamentals, and the recent state-of-the-art detection methods with their pros and cons. Furthermore, we introduce metrics and provide quantitative comparisons on popular public datasets. The avenues for future work are going to be judiciously identified after an in-deep analysis of the surveyed works. Finally, we conclude this paper.

Guidance · Performer · Extensibility · 路徑 · state-of-the-art ·

2021 年 2 月 8 日

Path Planning using Neural A* Search

Ryo Yonetani,Tatsunori Taniai,Mohammadamin Barekatain,Mai Nishimura,Asako Kanezaki

We present Neural A*, a novel data-driven search method for path planning problems. Despite the recent increasing attention to data-driven path planning, a machine learning approach to search-based planning is still challenging due to the discrete nature of search algorithms. In this work, we reformulate a canonical A* search algorithm to be differentiable and couple it with a convolutional encoder to form an end-to-end trainable neural network planner. Neural A* solves a path planning problem by encoding a problem instance to a guidance map and then performing the differentiable A* search with the guidance map. By learning to match the search results with ground-truth paths provided by experts, Neural A* can produce a path consistent with the ground truth accurately and efficiently. Our extensive experiments confirmed that Neural A* outperformed state-of-the-art data-driven planners in terms of the search optimality and efficiency trade-off, and furthermore, successfully predicted realistic human trajectories by directly performing search-based planning on natural image inputs.

即時定位與地圖構建 · SLAM · 回合 · 環 · 3D ·

2018 年 9 月 9 日

Simultaneous Localization and Mapping (SLAM) using RTAB-MAP

Sagarnil Das

from arxiv, 7 pages, 13 figures

This paper implements Simultaneous Localization and Mapping (SLAM) technique to construct a map of a given environment. A Real Time Appearance Based Mapping (RTAB-Map) approach was taken for accomplishing this task. Initially, a 2d occupancy grid and 3d octomap was created from a provided simulated environment. Next, a personal simulated environment was created for mapping as well. In this appearance based method, a process called Loop Closure is used to determine whether a robot has seen a location before or not. In this paper, it is seen that RTAB-Map is optimized for large scale and long term SLAM by using multiple strategies to allow for loop closure to be done in real time and the results depict that it can be an excellent solution for SLAM to develop robots that can map an environment in both 2d and 3d.