亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='q2w0v'><del id='q2w0v'><del id='q2w0v'></del><pre id='q2w0v'><pre id='q2w0v'><option id='q2w0v'><address id='q2w0v'></address><bdo id='q2w0v'><tr id='q2w0v'><acronym id='q2w0v'><pre id='q2w0v'></pre></acronym><div id='q2w0v'></div></tr></bdo></option></pre><small id='q2w0v'><address id='q2w0v'><u id='q2w0v'><legend id='q2w0v'><option id='q2w0v'><abbr id='q2w0v'></abbr><li id='q2w0v'><pre id='q2w0v'></pre></li></option></legend><select id='q2w0v'></select></u></address></small></pre></del><sup id='q2w0v'></sup><blockquote id='q2w0v'><dt id='q2w0v'></dt></blockquote><blockquote id='q2w0v'></blockquote></dir><tt id='q2w0v'></tt><u id='q2w0v'><tt id='q2w0v'><form id='q2w0v'></form></tt><td id='q2w0v'><dt id='q2w0v'></dt></td></u>

<code id='q2w0v'><i id='q2w0v'><q id='q2w0v'><legend id='q2w0v'><pre id='q2w0v'><style id='q2w0v'><acronym id='q2w0v'><i id='q2w0v'><form id='q2w0v'><option id='q2w0v'><center id='q2w0v'></center></option></form></i></acronym></style><tt id='q2w0v'></tt></pre></legend></q></i></code><center id='q2w0v'></center>

<dd id='q2w0v'></dd>

<style id='q2w0v'></style><sub id='q2w0v'><dfn id='q2w0v'><abbr id='q2w0v'><big id='q2w0v'><bdo id='q2w0v'></bdo></big></abbr></dfn></sub>_{<dir id='q2w0v'></dir>}

·

等變 · MoDELS · Agent · INTERACT · 可約的 ·

2023 年 10 月 26 日

EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving

Yuping Wang,Jier Chen

from arxiv, 6 pages, 7 figures

Forecasting vehicular motions in autonomous driving requires a deep understanding of agent interactions and the preservation of motion equivariance under Euclidean geometric transformations. Traditional models often lack the sophistication needed to handle the intricate dynamics inherent to autonomous vehicles and the interaction relationships among agents in the scene. As a result, these models have a lower model capacity, which then leads to higher prediction errors and lower training efficiency. In our research, we employ EqMotion, a leading equivariant particle, and human prediction model that also accounts for invariant agent interactions, for the task of multi-agent vehicle motion forecasting. In addition, we use a multi-modal prediction mechanism to account for multiple possible future paths in a probabilistic manner. By leveraging EqMotion, our model achieves state-of-the-art (SOTA) performance with fewer parameters (1.2 million) and a significantly reduced training time (less than 2 hours).

相關內容

掩碼 · 潛在 · 自編碼器 · MoDELS · 可約的 ·

2023 年 12 月 13 日

LMD: Faster Image Reconstruction with Latent Masking Diffusion

Zhiyuan Ma,zhihuan yu,Jianjun Li,Bowen Zhou

As a class of fruitful approaches, diffusion probabilistic models (DPMs) have shown excellent advantages in high-resolution image reconstruction. On the other hand, masked autoencoders (MAEs), as popular self-supervised vision learners, have demonstrated simpler and more effective image reconstruction and transfer capabilities on downstream tasks. However, they all require extremely high training costs, either due to inherent high temporal-dependence (i.e., excessively long diffusion steps) or due to artificially low spatial-dependence (i.e., human-formulated high mask ratio, such as 0.75). To the end, this paper presents LMD, a faster image reconstruction framework with latent masking diffusion. First, we propose to project and reconstruct images in latent space through a pre-trained variational autoencoder, which is theoretically more efficient than in the pixel-based space. Then, we combine the advantages of MAEs and DPMs to design a progressive masking diffusion model, which gradually increases the masking proportion by three different schedulers and reconstructs the latent features from simple to difficult, without sequentially performing denoising diffusion as in DPMs or using fixed high masking ratio as in MAEs, so as to alleviate the high training time-consumption predicament. Our approach allows for learning high-capacity models and accelerate their training (by 3x or more) and barely reduces the original accuracy. Inference speed in downstream tasks also significantly outperforms the previous approaches.

ACP · 評論員 · 可約的 · 分解的 · 模型評估 ·

2023 年 12 月 12 日

How Does Perception Affect Safety: New Metrics and Strategy

Xiaotong Zhang,Jinger Chong,Kamal Youcef-Toumi

Perception serves as a critical component in the functionality of autonomous agents. However, the intricate relationship between perception metrics and robotic metrics remains unclear, leading to ambiguity in the development and fine-tuning of perception algorithms. In this paper, we introduce a methodology for quantifying this relationship, taking into account factors such as detection rate, detection quality, and latency. Furthermore, we introduce two novel metrics for Human-Robot Collaboration safety predicated upon perception metrics: Critical Collision Probability (CCP) and Average Collision Probability (ACP). To validate the utility of these metrics in facilitating algorithm development and tuning, we develop an attentive processing strategy that focuses exclusively on key input features. This approach significantly reduces computational time while preserving a similar level of accuracy. Experimental results indicate that the implementation of this strategy in an object detector leads to a maximum reduction of 30.091% in inference time and 26.534% in total time per frame. Additionally, the strategy lowers the CCP and ACP in a baseline model by 11.252% and 13.501%, respectively. The source code will be made publicly available in the final proof version of the manuscript.

大語言模型 · 語言模型化 · 端到端 · INTERACT · MoDELS ·

2023 年 12 月 12 日

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Hao Shao,Yuxuan Hu,Letian Wang,Steven L. Waslander,Yu Liu,Hongsheng Li

Despite significant recent progress in the field of autonomous driving, modern methods still struggle and can incur serious accidents when encountering long-tail unforeseen events and challenging urban scenarios. On the one hand, large language models (LLM) have shown impressive reasoning capabilities that approach "Artificial General Intelligence". On the other hand, previous autonomous driving methods tend to rely on limited-format inputs (e.g. sensor data and navigation waypoints), restricting the vehicle's ability to understand language information and interact with humans. To this end, this paper introduces LMDrive, a novel language-guided, end-to-end, closed-loop autonomous driving framework. LMDrive uniquely processes and integrates multi-modal sensor data with natural language instructions, enabling interaction with humans and navigation software in realistic instructional settings. To facilitate further research in language-based closed-loop autonomous driving, we also publicly release the corresponding dataset which includes approximately 64K instruction-following data clips, and the LangAuto benchmark that tests the system's ability to handle complex instructions and challenging driving scenarios. Extensive closed-loop experiments are conducted to demonstrate LMDrive's effectiveness. To the best of our knowledge, we're the very first work to leverage LLMs for closed-loop end-to-end autonomous driving. Codes can be found at //github.com/opendilab/LMDrive

數據集 · Extensibility · 查準率/準確率 · TOOLS · 示例 ·

2023 年 12 月 12 日

CholecTrack20: A Dataset for Multi-Class Multiple Tool Tracking in Laparoscopic Surgery

Chinedu Innocent Nwoye,Kareem Elgohary,Anvita Srinivas,Fauzan Zaid,Jo?l L. Lavanchy,Nicolas Padoy

from arxiv, Surgical tool tracking dataset paper, 15 pages, 9 figures, 4 tables

Tool tracking in surgical videos is vital in computer-assisted intervention for tasks like surgeon skill assessment, safety zone estimation, and human-machine collaboration during minimally invasive procedures. The lack of large-scale datasets hampers Artificial Intelligence implementation in this domain. Current datasets exhibit overly generic tracking formalization, often lacking surgical context: a deficiency that becomes evident when tools move out of the camera's scope, resulting in rigid trajectories that hinder realistic surgical representation. This paper addresses the need for a more precise and adaptable tracking formalization tailored to the intricacies of endoscopic procedures by introducing CholecTrack20, an extensive dataset meticulously annotated for multi-class multi-tool tracking across three perspectives representing the various ways of considering the temporal duration of a tool trajectory: (1) intraoperative, (2) intracorporeal, and (3) visibility within the camera's scope. The dataset comprises 20 laparoscopic videos with over 35,000 frames and 65,000 annotated tool instances with details on spatial location, category, identity, operator, phase, and surgical visual conditions. This detailed dataset caters to the evolving assistive requirements within a procedure.

V2X · 數據集 · 多樣性 · MoDELS · Performer ·

2023 年 12 月 12 日

DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving

Tianqi Wang,Sukmin Kim,Wenxuan Ji,Enze Xie,Chongjian Ge,Junsong Chen,Zhenguo Li,Ping Luo

Safety is the primary priority of autonomous driving. Nevertheless, no published dataset currently supports the direct and explainable safety evaluation for autonomous driving. In this work, we propose DeepAccident, a large-scale dataset generated via a realistic simulator containing diverse accident scenarios that frequently occur in real-world driving. The proposed DeepAccident dataset includes 57K annotated frames and 285K annotated samples, approximately 7 times more than the large-scale nuScenes dataset with 40k annotated samples. In addition, we propose a new task, end-to-end motion and accident prediction, which can be used to directly evaluate the accident prediction ability for different autonomous driving algorithms. Furthermore, for each scenario, we set four vehicles along with one infrastructure to record data, thus providing diverse viewpoints for accident scenarios and enabling V2X (vehicle-to-everything) research on perception and prediction tasks. Finally, we present a baseline V2X model named V2XFormer that demonstrates superior performance for motion and accident prediction and 3D object detection compared to the single-vehicle model.

INTERACT · Integration · Pair · 置信度 · contrastive ·

2023 年 12 月 11 日

Pedestrian and Passenger Interaction with Autonomous Vehicles: Field Study in a Crosswalk Scenario

Rubén Izquierdo,Javier Alonso,Ola Benderius,Miguel ángel Sotelo,David Fernández Llorca

from arxiv, Submitted to the IEEE TIV; 13 pages, 13 figures, 7 tables. arXiv admin note: text overlap with arXiv:2307.12708

This study presents the outcomes of empirical investigations pertaining to human-vehicle interactions involving an autonomous vehicle equipped with both internal and external Human Machine Interfaces (HMIs) within a crosswalk scenario. The internal and external HMIs were integrated with implicit communication techniques, incorporating a combination of gentle and aggressive braking maneuvers within the crosswalk. Data were collected through a combination of questionnaires and quantifiable metrics, including pedestrian decision to cross related to the vehicle distance and speed. The questionnaire responses reveal that pedestrians experience enhanced safety perceptions when the external HMI and gentle braking maneuvers are used in tandem. In contrast, the measured variables demonstrate that the external HMI proves effective when complemented by the gentle braking maneuver. Furthermore, the questionnaire results highlight that the internal HMI enhances passenger confidence only when paired with the aggressive braking maneuver.

估計/估計量 · 3D · 數據集 · SimPLe · Performer ·

2023 年 12 月 11 日

PointVoxel: A Simple and Effective Pipeline for Multi-View Multi-Modal 3D Human Pose Estimation

Zhiyu Pan,Zhicheng Zhong,Wenxuan Guo,Yifan Chen,Jianjiang Feng,Jie Zhou

from arxiv, 14 pages, 10 figures

Recently, several methods have been proposed to estimate 3D human pose from multi-view images and achieved impressive performance on public datasets collected in relatively easy scenarios. However, there are limited approaches for extracting 3D human skeletons from multimodal inputs (e.g., RGB and pointcloud) that can enhance the accuracy of predicting 3D poses in challenging situations. We fill this gap by introducing a pipeline called PointVoxel that fuses multi-view RGB and pointcloud inputs to obtain 3D human poses. We demonstrate that volumetric representation is an effective architecture for integrating these different modalities. Moreover, in order to overcome the challenges of annotating 3D human pose labels in difficult scenarios, we develop a synthetic dataset generator for pretraining and design an unsupervised domain adaptation strategy so that we can obtain a well-trained 3D human pose estimator without using any manual annotations. We evaluate our approach on four datasets (two public datasets, one synthetic dataset, and one challenging dataset named BasketBall collected by ourselves), showing promising results. The code and dataset will be released soon.

BAT · 可約的 · MoDELS · 標注 · INTERACT ·

2023 年 12 月 11 日

BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving

Haicheng Liao,Zhenning Li,Huanming Shen,Wenxuan Zeng,Guofa Li,Shengbo Eben Li,Chengzhong Xu

The ability to accurately predict the trajectory of surrounding vehicles is a critical hurdle to overcome on the journey to fully autonomous vehicles. To address this challenge, we pioneer a novel behavior-aware trajectory prediction model (BAT) that incorporates insights and findings from traffic psychology, human behavior, and decision-making. Our model consists of behavior-aware, interaction-aware, priority-aware, and position-aware modules that perceive and understand the underlying interactions and account for uncertainty and variability in prediction, enabling higher-level learning and flexibility without rigid categorization of driving behavior. Importantly, this approach eliminates the need for manual labeling in the training process and addresses the challenges of non-continuous behavior labeling and the selection of appropriate time windows. We evaluate BAT's performance across the Next Generation Simulation (NGSIM), Highway Drone (HighD), Roundabout Drone (RounD), and Macao Connected Autonomous Driving (MoCAD) datasets, showcasing its superiority over prevailing state-of-the-art (SOTA) benchmarks in terms of prediction accuracy and efficiency. Remarkably, even when trained on reduced portions of the training data (25%), our model outperforms most of the baselines, demonstrating its robustness and efficiency in predicting vehicle trajectories, and the potential to reduce the amount of data required to train autonomous vehicles, especially in corner cases. In conclusion, the behavior-aware model represents a significant advancement in the development of autonomous vehicles capable of predicting trajectories with the same level of proficiency as human drivers. The project page is available at //github.com/Petrichor625/BATraj-Behavior-aware-Model.

相關系數 · MoDELS · INFORMS · 標注 · Attention ·

2023 年 12 月 10 日

MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Thinh Pham,Chi Tran,Dat Quoc Nguyen

from arxiv, Findings of EMNLP 2023 (//aclanthology.org/2023.findings-emnlp.841.pdf); Long paper - 10 pages; 3 figures and 3 tables

The research study of detecting multiple intents and filling slots is becoming more popular because of its relevance to complicated real-world situations. Recent advanced approaches, which are joint models based on graphs, might still face two potential issues: (i) the uncertainty introduced by constructing graphs based on preliminary intents and slots, which may transfer intent-slot correlation information to incorrect label node destinations, and (ii) direct incorporation of multiple intent labels for each token w.r.t. token-level intent voting might potentially lead to incorrect slot predictions, thereby hurting the overall performance. To address these two issues, we propose a joint model named MISCA. Our MISCA introduces an intent-slot co-attention mechanism and an underlying layer of label attention mechanism. These mechanisms enable MISCA to effectively capture correlations between intents and slot labels, eliminating the need for graph construction. They also facilitate the transfer of correlation information in both directions: from intents to slots and from slots to intents, through multiple levels of label-specific representations, without relying on token-level intent information. Experimental results show that MISCA outperforms previous models, achieving new state-of-the-art overall accuracy performances on two benchmark datasets MixATIS and MixSNIPS. This highlights the effectiveness of our attention mechanisms.

Learning · Agent · INTERACT · 深度強化學習 · motivation ·

2022 年 8 月 2 日

Deep Reinforcement Learning for Multi-Agent Interaction

Ibrahim H. Ahmed,Cillian Brewitt,Ignacio Carlucho,Filippos Christianos,Mhairi Dunion,Elliot Fosong,Samuel Garcin,Shangmin Guo,Balint Gyevnar,Trevor McInroe,Georgios Papoudakis,Arrasy Rahman,Lukas Sch?fer,Massimiliano Tamborski,Giuseppe Vecchio,Cheng Wang,Stefano V. Albrecht

from arxiv, Published in AI Communications Special Issue on Multi-Agent Systems Research in the UK

The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning. Research problems include scalable learning of coordinated agent policies and inter-agent communication; reasoning about the behaviours, goals, and composition of other agents from limited observations; and sample-efficient learning based on intrinsic motivation, curriculum learning, causal inference, and representation learning. This article provides a broad overview of the ongoing research portfolio of the group and discusses open problems for future directions.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='B08BW'><strong id='jHQXZ'></strong><small id='e23tN'></small><button id='K5ahD'></button><li id='RFGiE'><noscript id='FtVyd'><big id='hoZQu'></big><dt id='nGk4w'></dt></noscript></li></tr><ol id='a0AWs'><option id='Z6Bfz'><table id='Oxt1H'><blockquote id='7CZ7q'><tbody id='ZVsUS'></tbody></blockquote></table></option></ol><u id='Zcm0A'></u><kbd id='dTgVC'><kbd id='RfU1P'></kbd></kbd>

<code id='V7rAF'><strong id='ETDUG'></strong></code>

<fieldset id='dKCfg'></fieldset>

<span id='zjiAK'></span>

<ins id='4SDTf'></ins>

<acronym id='cF5jN'><em id='oMoaT'></em><td id='UkYCO'><div id='3CcWk'></div></td></acronym><address id='lb9y9'><big id='MDtjP'><big id='wu9Uo'></big><legend id='V9R0L'></legend></big></address>

<i id='knkl6'><div id='pO2rM'><ins id='rzCFN'></ins></div></i>

<i id='nk8N6'></i>