青柠在线观看免费高清1,在线看片日中文福利免费,国际精品久久久毛片久久久久久久

from arxiv, This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this article is published in Statistical Methods & Applications, and is available online at //doi.org/10.1007/s10260-023-00696-z

The past decade has seen an increased interest in human activity recognition based on sensor data. Most often, the sensor data come unannotated, creating the need for fast labelling methods. For assessing the quality of the labelling, an appropriate performance measure has to be chosen. Our main contribution is a novel post-processing method for activity recognition. It improves the accuracy of the classification methods by correcting for unrealistic short activities in the estimate. We also propose a new performance measure, the Locally Time-Shifted Measure (LTS measure), which addresses uncertainty in the times of state changes. The effectiveness of the post-processing method is evaluated, using the novel LTS measure, on the basis of a simulated dataset and a real application on sensor data from football. The simulation study is also used to discuss the choice of the parameters of the post-processing method and the LTS measure.

相關內容

Performer

關注 10

binary · 估計/估計量 · MoDELS · 成對型 · 模型評估 ·

2023 年 6 月 16 日

Multi-Classification using One-versus-One Deep Learning Strategy with Joint Probability Estimates

Anthony Hei-Long Chan,Raymond HonFu Chan,Lingjia Dai

The One-versus-One (OvO) strategy is an approach of multi-classification models which focuses on training binary classifiers between each pair of classes. While the OvO strategy takes advantage of balanced training data, the classification accuracy is usually hindered by the voting mechanism to combine all binary classifiers. In this paper, a novel OvO multi-classification model incorporating a joint probability measure is proposed under the deep learning framework. In the proposed model, a two-stage algorithm is developed to estimate the class probability from the pairwise binary classifiers. Given the binary classifiers, the pairwise probability estimate is calibrated by a distance measure on the separating feature hyperplane. From that, the class probability of the subject is estimated by solving a joint probability-based distance minimization problem. Numerical experiments in different applications show that the proposed model achieves generally higher classification accuracy than other state-of-the-art models.

LIDAR · 估計/估計量 · 點云 · 目標檢測 · Performer ·

2023 年 6 月 15 日

LMD: Light-weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds

Tobias Riedlinger,Marius Schubert,Sarina Penquitt,Jan-Marcel Kezmann,Pascal Colling,Karsten Kahl,Lutz Roese-Koerner,Michael Arnold,Urs Zimmermann,Matthias Rottmann

from arxiv, 19 pages, 11 figures, 11 tables

Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying prediction uncertainty tend to alter the training scheme of the detector or rely on prediction sampling which results in vastly increased inference time. In order to address these two issues, we propose LidarMetaDetect (LMD), a light-weight post-processing scheme for prediction quality estimation. Our method can easily be added to any pre-trained Lidar object detector without altering anything about the base model and is purely based on post-processing, therefore, only leading to a negligible computational overhead. Our experiments show a significant increase of statistical reliability in separating true from false predictions. We propose and evaluate an additional application of our method leading to the detection of annotation errors. Explicit samples and a conservative count of annotation error proposals indicates the viability of our method for large-scale datasets like KITTI and nuScenes. On the widely-used nuScenes test dataset, 43 out of the top 100 proposals of our method indicate, in fact, erroneous annotations.

數據預處理 · 事件抽取 · MoDELS · 可辨認的 · 輸出空間 ·

2023 年 6 月 15 日

The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation

Hao Peng,Xiaozhi Wang,Feng Yao,Kaisheng Zeng,Lei Hou,Juanzi Li,Zhiyuan Liu,Weixing Shen

from arxiv, Accepted at Findings of ACL 2023

Event extraction (EE) is a crucial task aiming at extracting events from texts, which includes two subtasks: event detection (ED) and event argument extraction (EAE). In this paper, we check the reliability of EE evaluations and identify three major pitfalls: (1) The data preprocessing discrepancy makes the evaluation results on the same dataset not directly comparable, but the data preprocessing details are not widely noted and specified in papers. (2) The output space discrepancy of different model paradigms makes different-paradigm EE models lack grounds for comparison and also leads to unclear mapping issues between predictions and annotations. (3) The absence of pipeline evaluation of many EAE-only works makes them hard to be directly compared with EE works and may not well reflect the model performance in real-world pipeline scenarios. We demonstrate the significant influence of these pitfalls through comprehensive meta-analyses of recent papers and empirical experiments. To avoid these pitfalls, we suggest a series of remedies, including specifying data preprocessing, standardizing outputs, and providing pipeline evaluation results. To help implement these remedies, we develop a consistent evaluation framework OMNIEVENT, which can be obtained from //github.com/THU-KEG/OmniEvent.

離散化 · 有偏 · 縮放 · 推斷 · Analysis ·

2023 年 6 月 15 日

The role of discretization scales in causal inference with continuous-time treatment

Jinghao Sun,Forrest W. Crawford

There are well-established methods for identifying the causal effect of a time-varying treatment applied at discrete time points. However, in the real world, many treatments are continuous or have a finer time scale than the one used for measurement or analysis. While researchers have investigated the discrepancies between estimates under varying discretization scales using simulations and empirical data, it is still unclear how the choice of discretization scale affects causal inference. To address this gap, we present a framework to understand how discretization scales impact the properties of causal inferences about the effect of a time-varying treatment. We introduce the concept of "identification bias", which is the difference between the causal estimand for a continuous-time treatment and the purported estimand of a discretized version of the treatment. We show that this bias can persist even with an infinite number of longitudinal treatment-outcome trajectories. We specifically examine the identification problem in a class of linear stochastic continuous-time data-generating processes and demonstrate the identification bias of the g-formula in this context. Our findings indicate that discretization bias can significantly impact empirical analysis, especially when there are limited repeated measurements. Therefore, we recommend that researchers carefully consider the choice of discretization scale and perform sensitivity analysis to address this bias. We also propose a simple and heuristic quantitative measure for sensitivity concerning discretization and suggest that researchers report this measure along with point and interval estimates in their work. By doing so, researchers can better understand and address the potential impact of discretization bias on causal inference.

機器人 · INTERACT · 優化器 · 控制器 · 多樣性 ·

2023 年 6 月 14 日

Language to Rewards for Robotic Skill Synthesis

Wenhao Yu,Nimrod Gileadi,Chuyuan Fu,Sean Kirmani,Kuang-Huei Lee,Montse Gonzalez Arenas,Hao-Tien Lewis Chiang,Tom Erez,Leonard Hasenclever,Jan Humplik,Brian Ichter,Ted Xiao,Peng Xu,Andy Zeng,Tingnan Zhang,Nicolas Heess,Dorsa Sadigh,Jie Tan,Yuval Tassa,Fei Xia

from arxiv, //language-to-reward.github.io/

Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to advance the capabilities of robotic control. However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing efforts in applying LLMs to robotics have largely treated LLMs as semantic planners or relied on human-engineered control primitives to interface with the robot. On the other hand, reward functions are shown to be flexible representations that can be optimized for control policies to achieve diverse tasks, while their semantic richness makes them suitable to be specified by LLMs. In this work, we introduce a new paradigm that harnesses this realization by utilizing LLMs to define reward parameters that can be optimized and accomplish variety of robotic tasks. Using reward as the intermediate interface generated by LLMs, we can effectively bridge the gap between high-level language instructions or corrections to low-level robot actions. Meanwhile, combining this with a real-time optimizer, MuJoCo MPC, empowers an interactive behavior creation experience where users can immediately observe the results and provide feedback to the system. To systematically evaluate the performance of our proposed method, we designed a total of 17 tasks for a simulated quadruped robot and a dexterous manipulator robot. We demonstrate that our proposed method reliably tackles 90% of the designed tasks, while a baseline using primitive skills as the interface with Code-as-policies achieves 50% of the tasks. We further validated our method on a real robot arm where complex manipulation skills such as non-prehensile pushing emerge through our interactive system.

估計/估計量 · 核化 · Performer · 正則化項 · Machine Learning ·

2023 年 6 月 14 日

Kernel Debiased Plug-in Estimation

Brian Cho,Kyra Gan,Ivana Malenica,Yaroslav Mukhin

We consider the problem of estimating a scalar target parameter in the presence of nuisance parameters. Replacing the unknown nuisance parameter with a nonparametric estimator, e.g.,a machine learning (ML) model, is convenient but has shown to be inefficient due to large biases. Modern methods, such as the targeted minimum loss-based estimation (TMLE) and double machine learning (DML), achieve optimal performance under flexible assumptions by harnessing ML estimates while mitigating the plug-in bias. To avoid a sub-optimal bias-variance trade-off, these methods perform a debiasing step of the plug-in pre-estimate. Existing debiasing methods require the influence function of the target parameter as input. However, deriving the IF requires specialized expertise and thus obstructs the adaptation of these methods by practitioners. We propose a novel way to debias plug-in estimators which (i) is efficient, (ii) does not require the IF to be implemented, (iii) is computationally tractable, and therefore can be readily adapted to new estimation problems and automated without analytic derivations by the user. We build on the TMLE framework and update a plug-in estimate with a regularized likelihood maximization step over a nonparametric model constructed with a reproducing kernel Hilbert space (RKHS), producing an efficient plug-in estimate for any regular target parameter. Our method, thus, offers the efficiency of competing debiasing techniques without sacrificing the utility of the plug-in approach.

Cognition · Automator · Attention · 回合 · Processing（編程語言） ·

2023 年 6 月 14 日

From Driver to Supervisor: Comparing Cognitive Load and EEG-based Attention Allocation across Automation Levels

Nikol Figalová,Hans-Joachim Bieg,Julian Elias Reiser,Yuan-Cheng Liu,Martin Baumann,Lewis Chuang,Olga Pollatos

from arxiv, 22 pages, 4 figures

With increasing automation, drivers' role progressively transitions from active operators to passive system supervisors, affecting their behaviour and cognitive processes. This study aims to understand better attention allocation and perceived cognitive load in manual, L2, and L3 driving in a realistic environment. We conducted a test-track experiment with 30 participants. While driving a prototype automated vehicle, participants were exposed to a passive auditory oddball task and their EEG was recorded. We studied the P3a ERP component elicited by novel environmental cues, an index of attention resources used to process the stimuli. The self-reported cognitive load was assessed using the NASA-TLX. Our findings revealed no significant difference in perceived cognitive load between manual and L2 driving, with L3 driving demonstrating a lower self-reported cognitive load. Despite this, P3a mean amplitude was highest during manual driving, indicating greater attention allocation towards processing environmental sounds compared to L2 and L3 driving. We argue that the need to integrate environmental information might be attenuated in L2 and L3 driving. Further empirical evidence is necessary to understand whether the decreased processing of environmental stimuli is due to top-down attention control leading to attention withdrawal or a lack of available resources due to high cognitive load. To the best of our knowledge, this study is the first attempt to use the passive oddball paradigm outside the laboratory. The insights of this study have significant implications for automation safety and user interface design.

穩健性 · 潛在 · MoDELS · 白盒 · 數據集 ·

2023 年 6 月 14 日

On the Robustness of Latent Diffusion Models

Jianping Zhang,Zhuoer Xu,Shiwen Cui,Changhua Meng,Weibin Wu,Michael R. Lyu

Latent diffusion models achieve state-of-the-art performance on a variety of generative tasks, such as image synthesis and image editing. However, the robustness of latent diffusion models is not well studied. Previous works only focus on the adversarial attacks against the encoder or the output image under white-box settings, regardless of the denoising process. Therefore, in this paper, we aim to analyze the robustness of latent diffusion models more thoroughly. We first study the influence of the components inside latent diffusion models on their white-box robustness. In addition to white-box scenarios, we evaluate the black-box robustness of latent diffusion models via transfer attacks, where we consider both prompt-transfer and model-transfer settings and possible defense mechanisms. However, all these explorations need a comprehensive benchmark dataset, which is missing in the literature. Therefore, to facilitate the research of the robustness of latent diffusion models, we propose two automatic dataset construction pipelines for two kinds of image editing models and release the whole dataset. Our code and dataset are available at \url{//github.com/jpzhang1810/LDM-Robustness}.

Continuity · HTTPS · SimPLe · 穩健性 · Processing（編程語言） ·

2023 年 6 月 13 日

Evaluation of Spatial Distortion in Multichannel Audio

Karn N. Watcharasupat,Alexander Lerch

from arxiv, Submitted to IEEE Signal Processing Letters

Despite the recent proliferation of spatial audio technologies, the evaluation of spatial quality continues to rely on subjective listening tests, often requiring expert listeners. Based on the duplex theory of spatial hearing, it is possible to construct a signal model for frequency-independent spatial distortion by accounting for inter-channel time and level differences relative to a reference signal. By using a combination of least-square optimization and heuristics, we propose a signal decomposition method to isolate the spatial error from a processed signal. This allows the computation of simple energy-ratio metrics, providing objective measures of spatial and non-spatial signal qualities, with minimal assumption and no dataset dependency. Experiments demonstrate robustness of the method against common signal degradation as introduced by, e.g., audio compression and music source separation. Implementation is available at //github.com/karnwatcharasupat/spauq.

學成 · RNN · 門控 · INFORMS · Performer ·

2018 年 10 月 25 日

Learning with Interpretable Structure from RNN

Bo-Jian Hou,Zhi-Hua Zhou

In structure learning, the output is generally a structure that is used as supervision information to achieve good performance. Considering the interpretation of deep learning models has raised extended attention these years, it will be beneficial if we can learn an interpretable structure from deep learning models. In this paper, we focus on Recurrent Neural Networks (RNNs) whose inner mechanism is still not clearly understood. We find that Finite State Automaton (FSA) that processes sequential data has more interpretable inner mechanism and can be learned from RNNs as the interpretable structure. We propose two methods to learn FSA from RNN based on two different clustering methods. We first give the graphical illustration of FSA for human beings to follow, which shows the interpretability. From the FSA's point of view, we then analyze how the performance of RNNs are affected by the number of gates, as well as the semantic meaning behind the transition of numerical hidden states. Our results suggest that RNNs with simple gated structure such as Minimal Gated Unit (MGU) is more desirable and the transitions in FSA leading to specific classification result are associated with corresponding words which are understandable by human beings.