精品自在线观看影片天天看,老司机国内精品久久久久精品,欧美成人Aⅴ视频网站,在线视频成人动漫,精品国产午夜一区二区三区

Scene transfer for vision-based mobile robotics applications is a highly relevant and challenging problem. The utility of a robot greatly depends on its ability to perform a task in the real world, outside of a well-controlled lab environment. Existing scene transfer end-to-end policy learning approaches often suffer from poor sample efficiency or limited generalization capabilities, making them unsuitable for mobile robotics applications. This work proposes an adaptive multi-pair contrastive learning strategy for visual representation learning that enables zero-shot scene transfer and real-world deployment. Control policies relying on the embedding are able to operate in unseen environments without the need for finetuning in the deployment environment. We demonstrate the performance of our approach on the task of agile, vision-based quadrotor flight. Extensive simulation and real-world experiments demonstrate that our approach successfully generalizes beyond the training domain and outperforms all baselines.

相關內容

Learning

關注 0

SOFT · Weight · 控制器 · 可約的 · 設計 ·

2023 年 11 月 1 日

A Modular Pneumatic Soft Gripper Design for Aerial Grasping and Landing

Hiu Ching Cheung,Ching-Wei Chang,Bailun Jiang,Chih-Yung Wen,Henry K. Chu

from arxiv, 7 pages, 13 figures, submitted to IEEE RoboSoft 2024

Aerial robots have garnered significant attention due to their potential applications in various industries, such as inspection, search and rescue, and drone delivery. However, the ability of these robots to effectively grasp and land on objects or surfaces is often crucial for the successful completion of missions. This paper presents a novel modular soft gripper design tailored explicitly for aerial grasping and landing operations. The proposed modular pneumatic soft gripper incorporates a feed-forward proportional controller to regulate pressure, enabling compliant gripping capabilities. The modular connectors of the soft fingers offer two configurations of the 4-finger soft gripper, H-base and X-base, allowing adaptability to different target objects. Furthermore, when deflated, the gripper can function as a soft landing gear, reducing the weight and complexity of aerial manipulation control and enhancing flight efficiency. We demonstrate the efficacy of indoor aerial grasping and achieve a maximum payload of 217 g for the proposed soft aerial vehicle (SAV), with the weight of the soft drone being 808 g.

圖形處理器 · Networking · 圖 · Neural Networks · Learning ·

2023 年 11 月 1 日

Semantic Representation Learning of Scientific Literature based on Adaptive Feature and Graph Neural Network

Hongrui Gao,Yawen Li,Meiyu Liang,Zeli Guan,Zhe Xue

Because most of the scientific literature data is unmarked, it makes semantic representation learning based on unsupervised graph become crucial. At the same time, in order to enrich the features of scientific literature, a learning method of semantic representation of scientific literature based on adaptive features and graph neural network is proposed. By introducing the adaptive feature method, the features of scientific literature are considered globally and locally. The graph attention mechanism is used to sum the features of scientific literature with citation relationship, and give each scientific literature different feature weights, so as to better express the correlation between the features of different scientific literature. In addition, an unsupervised graph neural network semantic representation learning method is proposed. By comparing the mutual information between the positive and negative local semantic representation of scientific literature and the global graph semantic representation in the potential space, the graph neural network can capture the local and global information, thus improving the learning ability of the semantic representation of scientific literature. The experimental results show that the proposed learning method of semantic representation of scientific literature based on adaptive feature and graph neural network is competitive on the basis of scientific literature classification, and has achieved good results.

Learning · Agent · 次最優 · INTERACT · AIM ·

2023 年 10 月 31 日

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

Marc Lanctot,John Schultz,Neil Burch,Max Olan Smith,Daniel Hennes,Thomas Anthony,Julien Perolat

from arxiv, 25 pages, 8 figures, Accepted at TMLR October 2023

Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been restricted to few interactions against experts, with the aim to reach some desired level of performance (e.g. beating a human professional player). We propose a benchmark for multiagent learning based on repeated play of the simple game Rock, Paper, Scissors along with a population of forty-three tournament entries, some of which are intentionally sub-optimal. We describe metrics to measure the quality of agents based both on average returns and exploitability. We then show that several RL, online learning, and language model approaches can learn good counter-strategies and generalize well, but ultimately lose to the top-performing bots, creating an opportunity for research in multiagent learning.

Performer · MoDELS · 設計 · Continuity · 語言模型化 ·

2023 年 10 月 31 日

ChipNeMo: Domain-Adapted LLMs for Chip Design

Mingjie Liu,Teo Ene,Robert Kirby,Chris Cheng,Nathaniel Pinckney,Rongjian Liang,Jonah Alben,Himyanshu Anand,Sanmitra Banerjee,Ismet Bayraktaroglu,Bonita Bhaskaran,Bryan Catanzaro,Arjun Chaudhuri,Sharon Clay,Bill Dally,Laura Dang,Parikshit Deshpande,Siddhanth Dhodhi,Sameer Halepete,Eric Hill,Jiashang Hu,Sumit Jain,Brucek Khailany,Kishor Kunal,Xiaowei Li,Hao Liu,Stuart Oberman,Sujeet Omar,Sreedhar Pratty,Ambar Sarkar,Zhengjiang Shao,Hanfei Sun,Pratik P Suthar,Varun Tej,Kaizhe Xu,Haoxing Ren

ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the following domain adaptation techniques: custom tokenizers, domain-adaptive continued pretraining, supervised fine-tuning (SFT) with domain-specific instructions, and domain-adapted retrieval models. We evaluate these methods on three selected LLM applications for chip design: an engineering assistant chatbot, EDA script generation, and bug summarization and analysis. Our results show that these domain adaptation techniques enable significant LLM performance improvements over general-purpose base models across the three evaluated applications, enabling up to 5x model size reduction with similar or better performance on a range of design tasks. Our findings also indicate that there's still room for improvement between our current results and ideal outcomes. We believe that further investigation of domain-adapted LLM approaches will help close this gap in the future.

Networking · Neural Networks · 推斷 · bulk · 查全率/召回率 ·

2023 年 10 月 31 日

A Low-cost Strategic Monitoring Approach for Scalable and Interpretable Error Detection in Deep Neural Networks

Florian Geissler,Syed Qutub,Michael Paulitsch,Karthik Pattabiraman

We present a highly compact run-time monitoring approach for deep computer vision networks that extracts selected knowledge from only a few (down to merely two) hidden layers, yet can efficiently detect silent data corruption originating from both hardware memory and input faults. Building on the insight that critical faults typically manifest as peak or bulk shifts in the activation distribution of the affected network layers, we use strategically placed quantile markers to make accurate estimates about the anomaly of the current inference as a whole. Importantly, the detector component itself is kept algorithmically transparent to render the categorization of regular and abnormal behavior interpretable to a human. Our technique achieves up to ~96% precision and ~98% recall of detection. Compared to state-of-the-art anomaly detection techniques, this approach requires minimal compute overhead (as little as 0.3% with respect to non-supervised inference time) and contributes to the explainability of the model.

高斯過程回歸 · Processing（編程語言） · MoDELS · 行人重識別 · 多樣性 ·

2023 年 10 月 31 日

Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval

Haolun Wu,Ofer Mesh,Masrour Zogh,Fernando Diaz, Xue, Liu,Craig Boutilier,Maryam Karimzadehgan

from arxiv, 16 pages, 5 figures

Accurate modeling of the diverse and dynamic interests of users remains a significant challenge in the design of personalized recommender systems. Existing user modeling methods, like single-point and multi-point representations, have limitations w.r.t. accuracy, diversity, computational cost, and adaptability. To overcome these deficiencies, we introduce density-based user representations (DURs), a novel model that leverages Gaussian process regression for effective multi-interest recommendation and retrieval. Our approach, GPR4DUR, exploits DURs to capture user interest variability without manual tuning, incorporates uncertainty-awareness, and scales well to large numbers of users. Experiments using real-world offline datasets confirm the adaptability and efficiency of GPR4DUR, while online experiments with simulated users demonstrate its ability to address the exploration-exploitation trade-off by effectively utilizing model uncertainty.

控制器 · Extensibility · BASIC · CASE · 層 ·

2023 年 10 月 30 日

Rule-Based Lloyd Algorithm for Multi-Robot Motion Planning and Control with Safety and Convergence Guarantees

Manuel Boldrer,Alvaro Serra-Gomez,Lorenzo Lyons,Javier Alonso-Mora,Laura Ferranti

This paper presents a distributed rule-based Lloyd algorithm (RBL) for multi-robot motion planning and control. The main limitations of the basic Loyd-based algorithm (LB) concern deadlock issues and the failure to address dynamic constraints effectively. Our contribution is twofold. First, we show how RBL is able to provide safety and convergence to the goal region without relying on communication between robots, nor neighbors control inputs, nor synchronization between the robots. We considered both case of holonomic and non-holonomic robots with control inputs saturation. Second, we show that the Lloyd-based algorithm (without rules) can be successfully used as a safety layer for learning-based approaches, leading to non-negligible benefits. We further prove the soundness, reliability, and scalability of RBL through extensive simulations, an updated comparison with the state of the art, and experimental validations on small-scale car-like robots.

MoDELS · 語言模型化 · 知識 (knowledge) · GPT3.5 · 蒙特卡洛樹搜索 ·

2023 年 10 月 30 日

Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Zirui Zhao,Wee Sun Lee,David Hsu

from arxiv, In Proceedings of NeurIPS 2023

Large-scale task planning is a major challenge. Recent work exploits large language models (LLMs) directly as a policy and shows surprisingly interesting results. This paper shows that LLMs provide a commonsense model of the world in addition to a policy that acts on it. The world model and the policy can be combined in a search algorithm, such as Monte Carlo Tree Search (MCTS), to scale up task planning. In our new LLM-MCTS algorithm, the LLM-induced world model provides a commonsense prior belief for MCTS to achieve effective reasoning; the LLM-induced policy acts as a heuristic to guide the search, vastly improving search efficiency. Experiments show that LLM-MCTS outperforms both MCTS alone and policies induced by LLMs (GPT2 and GPT3.5) by a wide margin, for complex, novel tasks. Further experiments and analyses on multiple tasks -- multiplication, multi-hop travel planning, object rearrangement -- suggest minimum description length (MDL) as a general guiding principle: if the description length of the world model is substantially smaller than that of the policy, using LLM as a world model for model-based planning is likely better than using LLM solely as a policy.

估計/估計量 · 通道 · INFORMS · Performer · 設計 ·

2023 年 10 月 28 日

Improving Channel Estimation Performance for Uplink OTFS Transmissions: Pilot Design based on A Posteriori Cramer-Rao Bound

Mingcheng Nie,Shuangyang Li,Deepak Mishra

Orthogonal time frequency space (OTFS) has been widely acknowledged as a promising wireless technology for challenging transmission scenarios, including high-mobility channels. In this paper, we investigate the pilot design for the multi-user OTFS system based on the a priori statistical channel state information (CSI), where the practical threshold-based estimation scheme is adopted. Specifically, we first derive the a posteriori Cramer-Rao bound (PCRB) based on a priori channel information for each user. According to our derivation, the PCRB only relates to the user's pilot signal-to-noise ratio (SNR) and the range of delay and Doppler shifts under the practical power-delay and power-Doppler profiles. Then, a pilot scheme is proposed to minimize the average PCRB of different users, where a closed-form global optimal pilot power allocation is derived. Our numerical results verify the multi-user PCRB analysis. Also, we demonstrate an around 3 dB improvement in the average normalized-mean-square error (NMSE) by using the proposed pilot design in comparison to the conventional embedded pilot design under the same total pilot power.

控制器 · 機器人 · 回合 · Performer · Less ·

2023 年 10 月 27 日

Mixed Reality Environment and High-Dimensional Continuification Control for Swarm Robotics

Gian Carlo Maffettone,Lorenzo Liguori,Eduardo Palermo,Mario di Bernardo,Maurizio Porfiri

A significant challenge in control theory and technology is to devise agile and less resource-intensive experiments for evaluating the performance and feasibility of control algorithms for the collective coordination of large-scale complex systems. Many new methodologies are based on macroscopic representations of the emerging system behavior, and can be easily validated only through numerical simulations, because of the inherent hurdle of developing full scale experimental platforms. In this paper, we introduce a novel hybrid mixed reality set-up for testing swarm robotics techniques, focusing on the collective motion of robotic swarms. This hybrid apparatus combines both real differential drive robots and virtual agents to create a heterogeneous swarm of tunable size. We validate the methodology by extending to higher dimensions, and investigating experimentally, continuification-based control methods for swarms. Our study demonstrates the versatility and effectiveness of the platform for conducting large-scale swarm robotics experiments. Also, it contributes new theoretical insights into control algorithms exploiting continuification approaches.