曰本中文字幕一区二区三区高清_WWW国产亚洲精品久久久日本_亚洲国产中文成人挑花视频_欧美日韩一区二区综合五月_中文字幕一区二区三三_人妻无码精品久久久久一区_蜜臀成人片免费视频在线观看

from arxiv, 4 pages, 3 figures. To appear in the ICRA 2023 workshop on The Role of Robotics Simulators for Unmanned Aerial Vehicles. Code available at //github.com/ntnu-arl/aerial_gym_simulator

Developing learning-based methods for navigation of aerial robots is an intensive data-driven process that requires highly parallelized simulation. The full utilization of such simulators is hindered by the lack of parallelized high-level control methods that imitate the real-world robot interface. Responding to this need, we develop the Aerial Gym simulator that can simulate millions of multirotor vehicles parallelly with nonlinear geometric controllers for the Special Euclidean Group SE(3) for attitude, velocity and position tracking. We also develop functionalities for managing a large number of obstacles in the environment, enabling rapid randomization for learning of navigation tasks. In addition, we also provide sample environments having robots with simulated cameras capable of capturing RGB, depth, segmentation and optical flow data in obstacle-rich environments. This simulator is a step towards developing a - currently missing - highly parallelized aerial robot simulation with geometric controllers at a large scale, while also providing a customizable obstacle randomization functionality for navigation tasks. We provide training scripts with compatible reinforcement learning frameworks to navigate the robot to a goal setpoint based on attitude and velocity command interfaces. Finally, we open source the simulator and aim to develop it further to speed up rendering using alternate kernel-based frameworks in order to parallelize ray-casting for depth images thus supporting a larger number of robots.

知識薈萃

精(jing)品入門和進階教程、論文和代碼(ma)整理等

查看相關VIP內容、論文(wen)、資訊等

縮放 · MoDELS · 平滑 · Obvious · 估計/估計量 ·

2023 年 7 月 13 日

Smooth hazards with multiple time scales

Angela Carollo,Paul H. C. Eilers,Hein Putter,Jutta Gampe

Hazard models are the most commonly used tool to analyse time-to-event data. If more than one time scale is relevant for the event under study, models are required that can incorporate the dependence of a hazard along two (or more) time scales. Such models should be flexible to capture the joint influence of several times scales and nonparametric smoothing techniques are obvious candidates. P-splines offer a flexible way to specify such hazard surfaces, and estimation is achieved by maximizing a penalized Poisson likelihood. Standard observations schemes, such as right-censoring and left-truncation, can be accommodated in a straightforward manner. The model can be extended to proportional hazards regression with a baseline hazard varying over two scales. Generalized linear array model (GLAM) algorithms allow efficient computations, which are implemented in a companion R-package.

Agent · Performer · Learning · 在線 · 強化學習 ·

2023 年 7 月 13 日

Bypassing the Simulation-to-reality Gap: Online Reinforcement Learning using a Supervisor

Benjamin David Evans,Johannes Betz,Hongrui Zheng,Herman A. Engelbrecht,Rahul Mangharam,Hendrik W. Jordaan

from arxiv, 7 Pages, 10 Figures, 1 Table

Deep reinforcement learning (DRL) is a promising method to learn control policies for robots only from demonstration and experience. To cover the whole dynamic behaviour of the robot, DRL training is an active exploration process typically performed in simulation environments. Although this simulation training is cheap and fast, applying DRL algorithms to real-world settings is difficult. If agents are trained until they perform safely in simulation, transferring them to physical systems is difficult due to the sim-to-real gap caused by the difference between the simulation dynamics and the physical robot. In this paper, we present a method of online training a DRL agent to drive autonomously on a physical vehicle by using a model-based safety supervisor. Our solution uses a supervisory system to check if the action selected by the agent is safe or unsafe and ensure that a safe action is always implemented on the vehicle. With this, we can bypass the sim-to-real problem while training the DRL algorithm safely, quickly, and efficiently. We compare our method with conventional learning in simulation and on a physical vehicle. We provide a variety of real-world experiments where we train online a small-scale vehicle to drive autonomously with no prior simulation training. The evaluation results show that our method trains agents with improved sample efficiency while never crashing, and the trained agents demonstrate better driving performance than those trained in simulation.

dynamic programming · state-of-the-art · 約束優化 · Performer · 正則化項 ·

2023 年 7 月 12 日

Budgeting Counterfactual for Offline RL

Yao Liu,Pratik Chaudhari,Rasool Fakoor

The main challenge of offline reinforcement learning, where data is limited, arises from a sequence of counterfactual reasoning dilemmas within the realm of potential actions: What if we were to choose a different course of action? These circumstances frequently give rise to extrapolation errors, which tend to accumulate exponentially with the problem horizon. Hence, it becomes crucial to acknowledge that not all decision steps are equally important to the final outcome, and to budget the number of counterfactual decisions a policy make in order to control the extrapolation. Contrary to existing approaches that use regularization on either the policy or value function, we propose an approach to explicitly bound the amount of out-of-distribution actions during training. Specifically, our method utilizes dynamic programming to decide where to extrapolate and where not to, with an upper bound on the decisions different from behavior policy. It balances between the potential for improvement from taking out-of-distribution actions and the risk of making errors due to extrapolation. Theoretically, we justify our method by the constrained optimality of the fixed point solution to our $Q$ updating rules. Empirically, we show that the overall performance of our method is better than the state-of-the-art offline RL methods on tasks in the widely-used D4RL benchmarks.

INTERACT · Performer · 值域 · FAST · 全 ·

2023 年 7 月 12 日

Cornerstone: Octree Construction Algorithms for Scalable Particle Simulations

Sebastian Keller,Aurélien Cavelan,Rubén Cabezon,Lucio Mayer,Florina M. Ciorba

This paper presents an octree construction method, called Cornerstone, that facilitates global domain decomposition and interactions between particles in mesh-free numerical simulations. Our method is based on algorithms developed for 3D computer graphics, which we extend to distributed high performance computing (HPC) systems. Cornerstone yields global and locally essential octrees and is able to operate on all levels of tree hierarchies in parallel. The resulting octrees are suitable for supporting the computation of various kinds of short and long range interactions in N-body methods, such as Barnes-Hut and the Fast Multipole Method (FMM). While we provide a CPU implementation, Cornerstone may run entirely on GPUs. This results in significantly faster tree construction compared to execution on CPUs and serves as a powerful building block for the design of simulation codes that move beyond an offloading approach, where only numerically intensive tasks are dispatched to GPUs. With data residing exclusively in GPU memory, Cornerstone eliminates data movements between CPUs and GPUs. As an example, we employ Cornerstone to generate locally essential octrees for a Barnes-Hut treecode running on almost the full LUMI-G system with up to 8 trillion particles.

機器人 · 自適應學習 · Performer · 控制器 · Learning ·

2023 年 7 月 12 日

Adaptive Learning based Upper-Limb Rehabilitation Training System with Collaborative Robot

Jun Hong Lim,Kaibo He,Zeji Yi,Chen Hou,Chen Zhang,Yanan Sui,Luming Li

Rehabilitation training for patients with motor disabilities usually requires specialized devices in rehabilitation centers. Home-based multi-purpose training would significantly increase treatment accessibility and reduce medical costs. While it is unlikely to equip a set of rehabilitation robots at home, we investigate the feasibility to use the general-purpose collaborative robot for rehabilitation therapies. In this work, we developed a new system for multi-purpose upper-limb rehabilitation training using a generic robot arm with human motor feedback and preference. We integrated surface electromyography, force/torque sensors, RGB-D cameras, and robot controllers with the Robot Operating System to enable sensing, communication, and control of the system. Imitation learning methods were adopted to imitate expert-provided training trajectories which could adapt to subject capabilities to facilitate in-home training. Our rehabilitation system is able to perform gross motor function and fine motor skill training with a gripper-based end-effector. We simulated system control in Gazebo and training effects (muscle activation level) in OpenSim and evaluated its real performance with human subjects. For all the subjects enrolled, our system achieved better training outcomes compared to specialist-assisted rehabilitation under the same conditions. Our work demonstrates the potential of utilizing collaborative robots for in-home motor rehabilitation training.

控制器 · 機器人 · MoDELS · Analysis · 線性的 ·

2023 年 7 月 12 日

Deep Learning Technology-Based Exoskeleton Robot Controller Development

Sk Hasan

Model-based control is preferred for robotics applications due to its systematic approach to linearize and control the robot's nonlinear dynamics. The fundamental challenge involved in implementing a model-based controller for robotics applications is the time delay associated with the real-time computation of the robot dynamics. Due to the sequential structure of the robot's dynamic equation of motion, the multicore CPU cannot reduce the control algorithm execution time. A high-speed processor is required to maintain a higher sampling rate. Neural network-based modeling offers an excellent solution for developing a parallel structured equivalent model of the sequential model that is suitable for parallel processing. In this paper, a Deep neural network-based parallel structured 7 degrees of freedom human lower extremity exoskeleton robot controller is developed. Forty-nine densely connected neurons are arranged in four layers to estimate joint torque requirements for tracking trajectories. For training, the deep neural network, an analytical model-based data generation technique is presented. A trained deep neural network is used for real-time joint torque prediction and a PD controller is incorporated to mitigate the prediction errors. Simulation results show high trajectory tracking performances. The developed controller's stability analysis is proved. The robustness of the controller against the parameter variation is analyzed with the help of the analysis of variance (ANOVA). A comparative study between the developed controller and the Computed Torque Controller, Model Reference Computed Torque Controller, Sliding Mode Controller, Adaptive controller, and Linear Quadratic Regulator are presented while keeping the same robot dynamics.

Extensibility · 知識提取 · CASES · 穩健性 · Networking ·

2023 年 7 月 11 日

Looking Beyond IoCs: Automatically Extracting Attack Patterns from External CTI

Md Tanvirul Alam,Dipkamal Bhusal,Youngja Park,Nidhi Rastogi

Public and commercial organizations extensively share cyberthreat intelligence (CTI) to prepare systems to defend against existing and emerging cyberattacks. However, traditional CTI has primarily focused on tracking known threat indicators such as IP addresses and domain names, which may not provide long-term value in defending against evolving attacks. To address this challenge, we propose to use more robust threat intelligence signals called attack patterns. LADDER is a knowledge extraction framework that can extract text-based attack patterns from CTI reports at scale. The framework characterizes attack patterns by capturing the phases of an attack in Android and enterprise networks and systematically maps them to the MITRE ATT\&CK pattern framework. LADDER can be used by security analysts to determine the presence of attack vectors related to existing and emerging threats, enabling them to prepare defenses proactively. We also present several use cases to demonstrate the application of LADDER in real-world scenarios. Finally, we provide a new, open-access benchmark malware dataset to train future cyberthreat intelligence models.

變分自編碼 · contrastive · 自編碼器 · MoDELS · Performer ·

2021 年 3 月 19 日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Zhe Xie,Chengxuan Liu,Yichi Zhang,Hongtao Lu,Dong Wang,Yue Ding

from arxiv, 11 pages, WWW 2021

Sequential recommendation as an emerging topic has attracted increasing attention due to its important practical significance. Models based on deep learning and attention mechanism have achieved good performance in sequential recommendation. Recently, the generative models based on Variational Autoencoder (VAE) have shown the unique advantage in collaborative filtering. In particular, the sequential VAE model as a recurrent version of VAE can effectively capture temporal dependencies among items in user sequence and perform sequential recommendation. However, VAE-based models suffer from a common limitation that the representational ability of the obtained approximate posterior distribution is limited, resulting in lower quality of generated samples. This is especially true for generating sequences. To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation. Specifically, we first introduce the adversarial training for sequence generation under the Adversarial Variational Bayes (AVB) framework, which enables our model to generate high-quality latent variables. Then, we employ the contrastive loss. The latent variables will be able to learn more personalized and salient characteristics by minimizing the contrastive loss. Besides, when encoding the sequence, we apply a recurrent and convolutional structure to capture global and local relationships in the sequence. Finally, we conduct extensive experiments on four real-world datasets. The experimental results show that our proposed ACVAE model outperforms other state-of-the-art methods.

GAN · 求逆 · 圖像還原 · BigGAN · MoDELS ·

2021 年 1 月 14 日

GAN Inversion: A Survey

Weihao Xia,Yulun Zhang,Yujiu Yang,Jing-Hao Xue,Bolei Zhou,Ming-Hsuan Yang

from arxiv, papers on generative modeling: //github.com/zhoubolei/awesome-generative-modeling awesome gan-inversion papers: //github.com/weihaox/awesome-image-translation/blob/master/awesome-gan-inversion.md

GAN inversion aims to invert a given image back into the latent space of a pretrained GAN model, for the image to be faithfully reconstructed from the inverted code by the generator. As an emerging technique to bridge the real and fake image domains, GAN inversion plays an essential role in enabling the pretrained GAN models such as StyleGAN and BigGAN to be used for real image editing applications. Meanwhile, GAN inversion also provides insights on the interpretation of GAN's latent space and how the realistic images can be generated. In this paper, we provide an overview of GAN inversion with a focus on its recent algorithms and applications. We cover important techniques of GAN inversion and their applications to image restoration and image manipulation. We further elaborate on some trends and challenges for future directions.

穩健性 · Neural Networks · 優化器 · Networking · CIFAR-10 ·

2020 年 12 月 3 日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Tejas Gokhale,Rushil Anirudh,Bhavya Kailkhura,Jayaraman J. Thiagarajan,Chitta Baral,Yezhou Yang

from arxiv, Accepted to AAAI 2021. Preprint

While existing work in robust deep learning has focused on small pixel-level $\ell_p$ norm-based perturbations, this may not account for perturbations encountered in several real world settings. In many such cases although test data might not be available, broad specifications about the types of perturbations (such as an unknown degree of rotation) may be known. We consider a setup where robustness is expected over an unseen test domain that is not i.i.d. but deviates from the training domain. While this deviation may not be exactly known, its broad characterization is specified a priori, in terms of attributes. We propose an adversarial training approach which learns to generate new samples so as to maximize exposure of the classifier to the attributes-space, without having access to the data from the test domain. Our adversarial training solves a min-max optimization problem, with the inner maximization generating adversarial perturbations, and the outer minimization finding model parameters by optimizing the loss on adversarial perturbations generated from the inner maximization. We demonstrate the applicability of our approach on three types of naturally occurring perturbations -- object-related shifts, geometric transformations, and common image corruptions. Our approach enables deep neural networks to be robust against a wide range of naturally occurring perturbations. We demonstrate the usefulness of the proposed approach by showing the robustness gains of deep neural networks trained using our adversarial training on MNIST, CIFAR-10, and a new variant of the CLEVR dataset.