亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<form id='05bc2'></form>

<bdo id='05bc2'><sup id='05bc2'><div id='05bc2'><bdo id='05bc2'></bdo></div></sup></bdo>

·

主動學習 · Learning · PV-RCNN · 標注 · Performer ·

2024 年 2 月 5 日

ActiveAnno3D - An Active Learning Framework for Multi-Modal 3D Object Detection

Ahmed Ghita,Bj?rk Antoniussen,Walter Zimmer,Ross Greer,Christian Cre?,Andreas M?gelmose,Mohan M. Trivedi,Alois C. Knoll

The curation of large-scale datasets is still costly and requires much time and resources. Data is often manually labeled, and the challenge of creating high-quality datasets remains. In this work, we fill the research gap using active learning for multi-modal 3D object detection. We propose ActiveAnno3D, an active learning framework to select data samples for labeling that are of maximum informativeness for training. We explore various continuous training methods and integrate the most efficient method regarding computational demand and detection performance. Furthermore, we perform extensive experiments and ablation studies with BEVFusion and PV-RCNN on the nuScenes and TUM Traffic Intersection dataset. We show that we can achieve almost the same performance with PV-RCNN and the entropy-based query strategy when using only half of the training data (77.25 mAP compared to 83.50 mAP) of the TUM Traffic Intersection dataset. BEVFusion achieved an mAP of 64.31 when using half of the training data and 75.0 mAP when using the complete nuScenes dataset. We integrate our active learning framework into the proAnno labeling tool to enable AI-assisted data selection and labeling and minimize the labeling costs. Finally, we provide code, weights, and visualization results on our website: //active3d-framework.github.io/active3d-framework.

相關內容

主動學習

主(zhu)動(dong)(dong)學(xue)(xue)習(xi)(xi)是(shi)機器(qi)學(xue)(xue)習(xi)(xi)（更普遍的(de)說是(shi)人工智(zhi)能）的(de)一個(ge)子領(ling)(ling)域，在(zai)(zai)統計(ji)學(xue)(xue)領(ling)(ling)域也叫查詢學(xue)(xue)習(xi)(xi)、最(zui)優實驗設(she)計(ji)。“學(xue)(xue)習(xi)(xi)模塊”和“選擇策(ce)略”是(shi)主(zhu)動(dong)(dong)學(xue)(xue)習(xi)(xi)算法(fa)(fa)的(de)2個(ge)基本且(qie)重要的(de)模塊。主(zhu)動(dong)(dong)學(xue)(xue)習(xi)(xi)是(shi)“一種學(xue)(xue)習(xi)(xi)方法(fa)(fa)，在(zai)(zai)這種方法(fa)(fa)中(zhong)，學(xue)(xue)生(sheng)(sheng)(sheng)會主(zhu)動(dong)(dong)或體驗性地(di)參(can)(can)與(yu)學(xue)(xue)習(xi)(xi)過程(cheng)(cheng)，并且(qie)根據學(xue)(xue)生(sheng)(sheng)(sheng)的(de)參(can)(can)與(yu)程(cheng)(cheng)度，有(you)不(bu)同(tong)程(cheng)(cheng)度的(de)主(zhu)動(dong)(dong)學(xue)(xue)習(xi)(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指(zhi)出：“學(xue)(xue)生(sheng)(sheng)(sheng)除了(le)被動(dong)(dong)地(di)聽(ting)課以外，還從(cong)事其他活動(dong)(dong)。” 在(zai)(zai)高(gao)等教育研(yan)究協會（ASHE）的(de)一份報告中(zhong)，作(zuo)者討(tao)論了(le)各種促進(jin)主(zhu)動(dong)(dong)學(xue)(xue)習(xi)(xi)的(de)方法(fa)(fa)。他們引用了(le)一些文獻，這些文獻表明學(xue)(xue)生(sheng)(sheng)(sheng)不(bu)僅要做聽(ting)，還必須(xu)做更多(duo)的(de)事情才能學(xue)(xue)習(xi)(xi)。他們必須(xu)閱讀，寫作(zuo)，討(tao)論并參(can)(can)與(yu)解(jie)決(jue)問題。此過程(cheng)(cheng)涉及三個(ge)學(xue)(xue)習(xi)(xi)領(ling)(ling)域，即知識，技(ji)能和態度（KSA）。這種學(xue)(xue)習(xi)(xi)行為分(fen)類法(fa)(fa)可以被認為是(shi)“學(xue)(xue)習(xi)(xi)過程(cheng)(cheng)的(de)目標”。特(te)別是(shi)，學(xue)(xue)生(sheng)(sheng)(sheng)必須(xu)從(cong)事諸如分(fen)析(xi)，綜合(he)和評估之類的(de)高(gao)級思維任務。

圖 · 圖形處理器 · 鏈路預測 · Neural Networks · Networking ·

2024 年 3 月 17 日

Multi-Relational Graph Neural Network for Out-of-Domain Link Prediction

Asma Sattar,Georgios Deligiorgis,Marco Trincavelli,Davide Bacciu

from arxiv, 8 pages, 3 figures, 3 Tables, conference [accepted in IEEE WCCI 2024]

Dynamic multi-relational graphs are an expressive relational representation for data enclosing entities and relations of different types, and where relationships are allowed to vary in time. Addressing predictive tasks over such data requires the ability to find structure embeddings that capture the diversity of the relationships involved, as well as their dynamic evolution. In this work, we establish a novel class of challenging tasks for dynamic multi-relational graphs involving out-of-domain link prediction, where the relationship being predicted is not available in the input graph. We then introduce a novel Graph Neural Network model, named GOOD, designed specifically to tackle the out-of-domain generalization problem. GOOD introduces a novel design concept for multi-relation embedding aggregation, based on the idea that good representations are such when it is possible to disentangle the mixing proportions of the different relational embeddings that have produced it. We also propose five benchmarks based on two retail domains, where we show that GOOD can effectively generalize predictions out of known relationship types and achieve state-of-the-art results. Most importantly, we provide insights into problems where out-of-domain prediction might be preferred to an in-domain formulation, that is, where the relationship to be predicted has very few positive examples.

INFORMS · 優化器 · 設計 · 模型評估 · 穩健性 ·

2024 年 3 月 16 日

Bayesian Design for Sampling Anomalous Spatio-Temporal Data

Katie Buchhorn,Kerrie Mengersen,Edgar Santos-Fernandez,James McGree

Data collected from arrays of sensors are essential for informed decision-making in various systems. However, the presence of anomalies can compromise the accuracy and reliability of insights drawn from the collected data or information obtained via statistical analysis. This study aims to develop a robust Bayesian optimal experimental design (BOED) framework with anomaly detection methods for high-quality data collection. We introduce a general framework that involves anomaly generation, detection and error scoring when searching for an optimal design. This method is demonstrated using two comprehensive simulated case studies: the first study uses a spatial dataset, and the second uses a spatio-temporal river network dataset. As a baseline approach, we employed a commonly used prediction-based utility function based on minimising errors. Results illustrate the trade-off between predictive accuracy and anomaly detection performance for our method under various design scenarios. An optimal design robust to anomalies ensures the collection and analysis of more trustworthy data, playing a crucial role in understanding the dynamics of complex systems such as the environment, therefore enabling informed decisions in monitoring, management, and response.

層 · Branch · 圖像分割 · 卷積 · MoDELS ·

2024 年 3 月 15 日

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Hongyuan Yu,Cheng Wan,Mengchen Liu,Dongdong Chen,Bin Xiao,Xiyang Dai

from arxiv, 8 pages, 3 figures, submitted to IROS 2024

Image segmentation is one of the most fundamental problems in computer vision and has drawn a lot of attentions due to its vast applications in image understanding and autonomous driving. However, designing effective and efficient segmentation neural architectures is a labor-intensive process that may require lots of trials by human experts. In this paper, we address the challenge of integrating multi-head self-attention into high resolution representation CNNs efficiently, by leveraging architecture search. Manually replacing convolution layers with multi-head self-attention is non-trivial due to the costly overhead in memory to maintain high resolution. By contrast, we develop a multi-target multi-branch supernet method, which not only fully utilizes the advantages of high-resolution features, but also finds the proper location for placing multi-head self-attention module. Our search algorithm is optimized towards multiple objective s (e.g., latency and mIoU) and capable of finding architectures on Pareto frontier with arbitrary number of branches in a single search. We further present a series of model via Hybrid Convolutional-Transformer Architecture Search (HyCTAS) method that searched for the best hybrid combination of light-weight convolution layers and memory-efficient self-attention layers between branches from different resolutions and fuse to high resolution for both efficiency and effectiveness. Extensive experiments demonstrate that HyCTAS outperforms previous methods on semantic segmentation task. Code and models are available at \url{//github.com/MarvinYu1995/HyCTAS}.

核化 · 估計/估計量 · 泛函 · Performer · Learning ·

2024 年 3 月 15 日

A Structure-Preserving Kernel Method for Learning Hamiltonian Systems

Jianyu Hu,Juan-Pablo Ortega,Daiying Yin

A structure-preserving kernel ridge regression method is presented that allows the recovery of potentially high-dimensional and nonlinear Hamiltonian functions out of datasets made of noisy observations of Hamiltonian vector fields. The method proposes a closed-form solution that yields excellent numerical performances that surpass other techniques proposed in the literature in this setup. From the methodological point of view, the paper extends kernel regression methods to problems in which loss functions involving linear functions of gradients are required and, in particular, a differential reproducing property and a Representer Theorem are proved in this context. The relation between the structure-preserving kernel estimator and the Gaussian posterior mean estimator is analyzed. A full error analysis is conducted that provides convergence rates using fixed and adaptive regularization parameters. The good performance of the proposed estimator is illustrated with various numerical experiments.

詞表 · 目標檢測 · Extensibility · Performance · 標注 ·

2024 年 3 月 13 日

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

Yan Li,Weiwei Guo,Xue Yang,Ning Liao,Dunyun He,Jiaqi Zhou,Wenxian Yu

An increasingly massive number of remote-sensing images spurs the development of extensible object detectors that can detect objects beyond training categories without costly collecting new labeled data. In this paper, we aim to develop open-vocabulary object detection (OVD) technique in aerial images that scales up object vocabulary size beyond training data. The fundamental challenges hinder open vocabulary object detection performance: the qualities of the class-agnostic region proposals and the pseudo-labels that can generalize well to novel object categories. To simultaneously generate high-quality proposals and pseudo-labels, we propose CastDet, a CLIP-activated student-teacher open-vocabulary object Detection framework. Our end-to-end framework following the student-teacher self-learning mechanism employs the RemoteCLIP model as an extra omniscient teacher with rich knowledge. By doing so, our approach boosts not only novel object proposals but also classification. Furthermore, we devise a dynamic label queue strategy to maintain high-quality pseudo labels during batch training. We conduct extensive experiments on multiple existing aerial object detection datasets, which are set up for the OVD task. Experimental results demonstrate our CastDet achieving superior open-vocabulary detection performance, e.g., reaching 40.5\% mAP, which outperforms previous methods Detic/ViLD by 23.7%/14.9% on the VisDroneZSD dataset. To our best knowledge, this is the first work to apply and develop the open-vocabulary object detection technique for aerial images.

INFORMS · 穩健性 · Wyner-Ziv · CASE · Learning ·

2024 年 3 月 13 日

Robust Distributed Compression with Learned Heegard-Berger Scheme

Eyyup Tasci,Ezgi Ozyilkan,Oguzhan Kubilay Ulger,Elza Erkip

We consider lossy compression of an information source when decoder-only side information may be absent. This setup, also referred to as the Heegard-Berger or Kaspi problem, is a special case of robust distributed source coding. Building upon previous works on neural network-based distributed compressors developed for the decoder-only side information (Wyner-Ziv) case, we propose learning-based schemes that are amenable to the availability of side information. We find that our learned compressors mimic the achievability part of the Heegard-Berger theorem and yield interpretable results operating close to information-theoretic bounds. Depending on the availability of the side information, our neural compressors recover characteristics of the point-to-point (i.e., with no side information) and the Wyner-Ziv coding strategies that include binning in the source space, although no structure exploiting knowledge of the source and side information was imposed into the design.

大語言模型 · Performance · MoDELS · 語言模型化 · Boosting（一種模型訓練加速方式） ·

2024 年 3 月 13 日

Boosting Disfluency Detection with Large Language Model as Disfluency Generator

Zhenrong Cheng,Jiayan Guo,Hao Sun,Yan Zhang

Current disfluency detection methods heavily rely on costly and scarce human-annotated data. To tackle this issue, some approaches employ heuristic or statistical features to generate disfluent sentences, partially improving detection performance. However, these sentences often deviate from real-life scenarios, constraining overall model enhancement. In this study, we propose a lightweight data augmentation approach for disfluency detection, utilizing the superior generative and semantic understanding capabilities of large language model (LLM) to generate disfluent sentences as augmentation data. We leverage LLM to generate diverse and more realistic sentences guided by specific prompts, without the need for fine-tuning the LLM. Subsequently, we apply an uncertainty-aware data filtering approach to improve the quality of the generated sentences, utilized in training a small detection model for improved performance. Experiments using enhanced data yielded state-of-the-art results. The results showed that using a small amount of LLM-generated enhanced data can significantly improve performance, thereby further enhancing cost-effectiveness.

Extensibility · 學成 · 噪聲分布 · Networking · 表征學習 ·

2021 年 7 月 25 日

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Xinru Chen,Chengbo Dong,Jiaqi Ji,Juan Cao,Xirong Li

from arxiv, Accepted by ICCV 2021

The key challenge of image manipulation detection is how to learn generalizable features that are sensitive to manipulations in novel data, whilst specific to prevent false alarms on authentic images. Current research emphasizes the sensitivity, with the specificity overlooked. In this paper we address both aspects by multi-view feature learning and multi-scale supervision. By exploiting noise distribution and boundary artifact surrounding tampered regions, the former aims to learn semantic-agnostic and thus more generalizable features. The latter allows us to learn from authentic images which are nontrivial to be taken into account by current semantic segmentation network based methods. Our thoughts are realized by a new network which we term MVSS-Net. Extensive experiments on five benchmark sets justify the viability of MVSS-Net for both pixel-level and image-level manipulation detection.

圖形處理器 · MoDELS · Networking · Neural Networks · 圖 ·

2021 年 6 月 9 日

Cross-Node Federated Graph Neural Network for Spatio-Temporal Data Modeling

Chuizheng Meng,Sirisha Rambhatla,Yan Liu

from arxiv, To be published in the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 21)

Vast amount of data generated from networks of sensors, wearables, and the Internet of Things (IoT) devices underscores the need for advanced modeling techniques that leverage the spatio-temporal structure of decentralized data due to the need for edge computation and licensing (data access) issues. While federated learning (FL) has emerged as a framework for model training without requiring direct data sharing and exchange, effectively modeling the complex spatio-temporal dependencies to improve forecasting capabilities still remains an open problem. On the other hand, state-of-the-art spatio-temporal forecasting models assume unfettered access to the data, neglecting constraints on data sharing. To bridge this gap, we propose a federated spatio-temporal model -- Cross-Node Federated Graph Neural Network (CNFGNN) -- which explicitly encodes the underlying graph structure using graph neural network (GNN)-based architecture under the constraint of cross-node federated learning, which requires that data in a network of nodes is generated locally on each node and remains decentralized. CNFGNN operates by disentangling the temporal dynamics modeling on devices and spatial dynamics on the server, utilizing alternating optimization to reduce the communication cost, facilitating computations on the edge devices. Experiments on the traffic flow forecasting task show that CNFGNN achieves the best forecasting performance in both transductive and inductive learning settings with no extra computation cost on edge devices, while incurring modest communication cost.

圖 · 表征學習 · 知識圖譜 · INTERACT · Performer ·

2019 年 1 月 23 日

Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation

Hongwei Wang,Fuzheng Zhang,Miao Zhao,Wenjie Li,Xing Xie,Minyi Guo

from arxiv, In Proceedings of The 2019 Web Conference (WWW 2019)

Collaborative filtering often suffers from sparsity and cold start problems in real recommendation scenarios, therefore, researchers and engineers usually use side information to address the issues and improve the performance of recommender systems. In this paper, we consider knowledge graphs as the source of side information. We propose MKR, a Multi-task feature learning approach for Knowledge graph enhanced Recommendation. MKR is a deep end-to-end framework that utilizes knowledge graph embedding task to assist recommendation task. The two tasks are associated by cross&compress units, which automatically share latent features and learn high-order interactions between items in recommender systems and entities in the knowledge graph. We prove that cross&compress units have sufficient capability of polynomial approximation, and show that MKR is a generalized framework over several representative methods of recommender systems and multi-task learning. Through extensive experiments on real-world datasets, we demonstrate that MKR achieves substantial gains in movie, book, music, and news recommendation, over state-of-the-art baselines. MKR is also shown to be able to maintain a decent performance even if user-item interactions are sparse.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

主動學習(xi)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='05bc2'><strong id='05bc2'></strong><small id='05bc2'></small><button id='05bc2'></button><li id='05bc2'><noscript id='05bc2'><big id='05bc2'></big><dt id='05bc2'></dt></noscript></li></tr><ol id='05bc2'><option id='05bc2'><table id='05bc2'><blockquote id='05bc2'><tbody id='05bc2'></tbody></blockquote></table></option></ol><u id='05bc2'></u><kbd id='05bc2'><kbd id='05bc2'></kbd></kbd>

<code id='05bc2'><strong id='05bc2'></strong></code>

<fieldset id='05bc2'></fieldset>

<span id='05bc2'></span>

<ins id='05bc2'></ins>

<acronym id='05bc2'><em id='05bc2'></em><td id='05bc2'><div id='05bc2'></div></td></acronym><address id='05bc2'><big id='05bc2'><big id='05bc2'></big><legend id='05bc2'></legend></big></address>

<i id='05bc2'><div id='05bc2'><ins id='05bc2'></ins></div></i>

<i id='05bc2'></i>