亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

·

Networking · 目標檢測 · Performance · 示例 · 可約的 ·

2024 年 2 月 15 日

Label-Efficient Object Detection via Region Proposal Network Pre-Training

Nanqing Dong,Linus Ericsson,Yongxin Yang,Ales Leonardis,Steven McDonagh

from arxiv, Accepted by Neurocomputing

Self-supervised pre-training, based on the pretext task of instance discrimination, has fueled the recent advance in label-efficient object detection. However, existing studies focus on pre-training only a feature extractor network to learn transferable representations for downstream detection tasks. This leads to the necessity of training multiple detection-specific modules from scratch in the fine-tuning phase. We argue that the region proposal network (RPN), a common detection-specific module, can additionally be pre-trained towards reducing the localization error of multi-stage detectors. In this work, we propose a simple pretext task that provides an effective pre-training for the RPN, towards efficiently improving downstream object detection performance. We evaluate the efficacy of our approach on benchmark object detection tasks and additional downstream tasks, including instance segmentation and few-shot detection. In comparison with multi-stage detectors without RPN pre-training, our approach is able to consistently improve downstream task performance, with largest gains found in label-scarce settings.

相關內容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：國際網絡會議。 Publisher：IFIP。 SIT：

Boosting（一種模型訓練加速方式） · Performer · 目標檢測 · 不變 · Learning ·

2024 年 3 月 27 日

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

Zhipeng Du,Miaojing Shi,Jiankang Deng

from arxiv, Accepted to CVPR 2024

Detecting objects in low-light scenarios presents a persistent challenge, as detectors trained on well-lit data exhibit significant performance degradation on low-light data due to low visibility. Previous methods mitigate this issue by exploring image enhancement or object detection techniques with real low-light image datasets. However, the progress is impeded by the inherent difficulties about collecting and annotating low-light images. To address this challenge, we propose to boost low-light object detection with zero-shot day-night domain adaptation, which aims to generalize a detector from well-lit scenarios to low-light ones without requiring real low-light data. Revisiting Retinex theory in the low-level vision, we first design a reflectance representation learning module to learn Retinex-based illumination invariance in images with a carefully designed illumination invariance reinforcement strategy. Next, an interchange-redecomposition-coherence procedure is introduced to improve over the vanilla Retinex image decomposition process by performing two sequential image decompositions and introducing a redecomposition cohering loss. Extensive experiments on ExDark, DARK FACE, and CODaN datasets show strong low-light generalizability of our method. Our code is available at //github.com/ZPDu/DAI-Net.

MoDELS · Learning · Performer · 聯邦學習 · 情景 ·

2024 年 3 月 27 日

Stragglers-Aware Low-Latency Synchronous Federated Learning via Layer-Wise Model Updates

Natalie Lang,Alejandro Cohen,Nir Shlezinger

Synchronous federated learning (FL) is a popular paradigm for collaborative edge learning. It typically involves a set of heterogeneous devices locally training neural network (NN) models in parallel with periodic centralized aggregations. As some of the devices may have limited computational resources and varying availability, FL latency is highly sensitive to stragglers. Conventional approaches discard incomplete intra-model updates done by stragglers, alter the amount of local workload and architecture, or resort to asynchronous settings; which all affect the trained model performance under tight training latency constraints. In this work, we propose straggler-aware layer-wise federated learning (SALF) that leverages the optimization procedure of NNs via backpropagation to update the global model in a layer-wise fashion. SALF allows stragglers to synchronously convey partial gradients, having each layer of the global model be updated independently with a different contributing set of users. We provide a theoretical analysis, establishing convergence guarantees for the global model under mild assumptions on the distribution of the participating devices, revealing that SALF converges at the same asymptotic rate as FL with no timing limitations. This insight is matched with empirical observations, demonstrating the performance gains of SALF compared to alternative mechanisms mitigating the device heterogeneity gap in FL.

Automator · Agent · CRAFT · Integration · HTTPS ·

2024 年 3 月 26 日

Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving

Axel Brunnbauer,Luigi Berducci,Peter Priller,Dejan Nickovic,Radu Grosu

from arxiv, 7 Pages, Under Review

The automated generation of diverse and complex training scenarios has been an important ingredient in many complex learning tasks. Especially in real-world application domains, such as autonomous driving, auto-curriculum generation is considered vital for obtaining robust and general policies. However, crafting traffic scenarios with multiple, heterogeneous agents is typically considered as a tedious and time-consuming task, especially in more complex simulation environments. In our work, we introduce MATS-Gym, a Multi-Agent Traffic Scenario framework to train agents in CARLA, a high-fidelity driving simulator. MATS-Gym is a multi-agent training framework for autonomous driving that uses partial scenario specifications to generate traffic scenarios with variable numbers of agents. This paper unifies various existing approaches to traffic scenario description into a single training framework and demonstrates how it can be integrated with techniques from unsupervised environment design to automate the generation of adaptive auto-curricula. The code is available at //github.com/AutonomousDrivingExaminer/mats-gym.

3D · Vision · 塑造 · PULSE · 傳感器 ·

2024 年 3 月 26 日

Towards 3D Vision with Low-Cost Single-Photon Cameras

Fangzhou Mu,Carter Sifferman,Sacha Jungerman,Yiquan Li,Mark Han,Michael Gleicher,Mohit Gupta,Yin Li

We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras. These cameras, operating as time resolved image sensors, illuminate the scene with a very fast pulse of diffuse light and record the shape of that pulse as it returns back from the scene at a high temporal resolution. We propose to model this image formation process, account for its non-idealities, and adapt neural rendering to reconstruct 3D geometry from a set of spatially distributed sensors with known poses. We show that our approach can successfully recover complex 3D shapes from simulated data. We further demonstrate 3D object reconstruction from real-world captures, utilizing measurements from a commodity proximity sensor. Our work draws a connection between image-based modeling and active range scanning and is a step towards 3D vision with single-photon cameras.

泛函 · Performer · Learning · 代價 · 極大 ·

2024 年 3 月 25 日

Decision-Oriented Learning Using Differentiable Submodular Maximization for Multi-Robot Coordination

Guangyao Shi,Chak Lam Shek,Nare Karapetyan,Pratap Tokekar

from arxiv, arXiv admin note: text overlap with arXiv:2303.01543

We present a differentiable, decision-oriented learning framework for cost prediction in a class of multi-robot decision-making problems, in which the robots need to trade off the task performance with the costs of taking actions when they select actions to take. Specifically, we consider the cases where the task performance is measured by a known monotone submodular function (e.g., coverage, mutual information), and the cost of actions depends on the context (e.g., wind and terrain conditions). We need to learn a function that maps the context to the costs. Classically, we treat such a learning problem and the downstream decision-making problem as two decoupled problems, i.e., we first learn to predict the cost function without considering the downstream decision-making problem, and then use the learned function for predicting the cost and using it in the decision-making problem. However, the loss function used in learning a prediction function may not be aligned with the downstream decision-making. We propose a decision-oriented learning framework that incorporates the downstream task performance in the prediction phase via a differentiable optimization layer. The main computational challenge in such a framework is to make the combinatorial optimization, i.e., non-monotone submodular maximization, differentiable. This function is not naturally differentiable. We propose the Differentiable Cost Scaled Greedy algorithm (D-CSG), which is a continuous and differentiable relaxation of CSG. We demonstrate the efficacy of the proposed framework through numerical simulations. The results show that the proposed framework can result in better performance than the traditional two-stage approach.

可辨認的 · 跡 · 回合 · 表示 · Learning ·

2024 年 3 月 22 日

Identifying Linearly-Mixed Causal Representations from Multi-Node Interventions

Simon Bing,Urmi Ninad,Jonas Wahl,Jakob Runge

from arxiv, Accepted for publication at CLeaR 2024

The task of inferring high-level causal variables from low-level observations, commonly referred to as causal representation learning, is fundamentally underconstrained. As such, recent works to address this problem focus on various assumptions that lead to identifiability of the underlying latent causal variables. A large corpus of these preceding approaches consider multi-environment data collected under different interventions on the causal model. What is common to virtually all of these works is the restrictive assumption that in each environment, only a single variable is intervened on. In this work, we relax this assumption and provide the first identifiability result for causal representation learning that allows for multiple variables to be targeted by an intervention within one environment. Our approach hinges on a general assumption on the coverage and diversity of interventions across environments, which also includes the shared assumption of single-node interventions of previous works. The main idea behind our approach is to exploit the trace that interventions leave on the variance of the ground truth causal variables and regularizing for a specific notion of sparsity with respect to this trace. In addition to and inspired by our theoretical contributions, we present a practical algorithm to learn causal representations from multi-node interventional data and provide empirical evidence that validates our identifiability results.

Conformer · 覆蓋 · 穩健性 · 置信度 · 測試數據 ·

2024 年 3 月 22 日

Robust Conformal Prediction under Distribution Shift via Physics-Informed Structural Causal Model

Rui Xu,Yue Sun,Chao Chen,Parv Venkitasubramaniam,Sihong Xie

Uncertainty is critical to reliable decision-making with machine learning. Conformal prediction (CP) handles uncertainty by predicting a set on a test input, hoping the set to cover the true label with at least $(1-\alpha)$ confidence. This coverage can be guaranteed on test data even if the marginal distributions $P_X$ differ between calibration and test datasets. However, as it is common in practice, when the conditional distribution $P_{Y|X}$ is different on calibration and test data, the coverage is not guaranteed and it is essential to measure and minimize the coverage loss under distributional shift at \textit{all} possible confidence levels. To address these issues, we upper bound the coverage difference at all levels using the cumulative density functions of calibration and test conformal scores and Wasserstein distance. Inspired by the invariance of physics across data distributions, we propose a physics-informed structural causal model (PI-SCM) to reduce the upper bound. We validated that PI-SCM can improve coverage robustness along confidence level and test domain on a traffic speed prediction task and an epidemic spread task with multiple real-world datasets.

方陣 · 代碼 · 散度 · CASE · 泛函 ·

2024 年 3 月 21 日

Output-Constrained Lossy Source Coding With Application to Rate-Distortion-Perception Theory

Li Xie,Liangyan Li,Jun Chen,Zhongshan Zhang

The distortion-rate function of output-constrained lossy source coding with limited common randomness is analyzed for the special case of squared error distortion measure. An explicit expression is obtained when both source and reconstruction distributions are Gaussian. This further leads to a partial characterization of the information-theoretic limit of quadratic Gaussian rate-distortion-perception coding with the perception measure given by Kullback-Leibler divergence or squared quadratic Wasserstein distance.

contrastive · 學成 · 對比學習 · 目標檢測 · 優化器 ·

2021 年 4 月 4 日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Xinlong Wang,Rufeng Zhang,Chunhua Shen,Tao Kong,Lei Li

from arxiv, 11 pages. Accepted to IEEE/CVF Conf. Comp. Vision Pattern Recognition (CVPR) 2021; Oral paper

To date, most existing self-supervised learning methods are designed and optimized for image classification. These pre-trained models can be sub-optimal for dense prediction tasks due to the discrepancy between image-level prediction and pixel-level prediction. To fill this gap, we aim to design an effective, dense self-supervised learning method that directly works at the level of pixels (or local features) by taking into account the correspondence between local features. We present dense contrastive learning, which implements self-supervised learning by optimizing a pairwise contrastive (dis)similarity loss at the pixel level between two views of input images. Compared to the baseline method MoCo-v2, our method introduces negligible computation overhead (only <1% slower), but demonstrates consistently superior performance when transferring to downstream dense prediction tasks including object detection, semantic segmentation and instance segmentation; and outperforms the state-of-the-art methods by a large margin. Specifically, over the strong MoCo-v2 baseline, our method achieves significant improvements of 2.0% AP on PASCAL VOC object detection, 1.1% AP on COCO object detection, 0.9% AP on COCO instance segmentation, 3.0% mIoU on PASCAL VOC semantic segmentation and 1.8% mIoU on Cityscapes semantic segmentation. Code is available at: //git.io/AdelaiDet

小樣本學習 · 目標檢測 · Networking · 數據集 · 情景 ·

2020 年 3 月 31 日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Qi Fan,Wei Zhuo,Chi-Keung Tang,Yu-Wing Tai

from arxiv, CVPR2020 Camera Ready. (Fix Figure 3 and Table 5. More implementation details in the supplementary material.)

Conventional methods for object detection typically require a substantial amount of training data and preparing such high-quality training data is very labor-intensive. In this paper, we propose a novel few-shot object detection network that aims at detecting objects of unseen categories with only a few annotated examples. Central to our method are our Attention-RPN, Multi-Relation Detector and Contrastive Training strategy, which exploit the similarity between the few shot support set and query set to detect novel objects while suppressing false detection in the background. To train our network, we contribute a new dataset that contains 1000 categories of various objects with high-quality annotations. To the best of our knowledge, this is one of the first datasets specifically designed for few-shot object detection. Once our few-shot network is trained, it can detect objects of unseen categories without further training or fine-tuning. Our method is general and has a wide range of potential applications. We produce a new state-of-the-art performance on different datasets in the few-shot setting. The dataset link is //github.com/fanq15/Few-Shot-Object-Detection-Dataset.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='lggsj'><del id='lggsj'><del id='lggsj'></del><pre id='lggsj'><pre id='lggsj'><option id='lggsj'><address id='lggsj'></address><bdo id='lggsj'><tr id='lggsj'><acronym id='lggsj'><pre id='lggsj'></pre></acronym><div id='lggsj'></div></tr></bdo></option></pre><small id='lggsj'><address id='lggsj'><u id='lggsj'><legend id='lggsj'><option id='lggsj'><abbr id='lggsj'></abbr><li id='lggsj'><pre id='lggsj'></pre></li></option></legend><select id='lggsj'></select></u></address></small></pre></del><sup id='lggsj'></sup><blockquote id='lggsj'><dt id='lggsj'></dt></blockquote><blockquote id='lggsj'></blockquote></dir><tt id='lggsj'></tt><u id='lggsj'><tt id='lggsj'><form id='lggsj'></form></tt><td id='lggsj'><dt id='lggsj'></dt></td></u>

<code id='lggsj'><i id='lggsj'><q id='lggsj'><legend id='lggsj'><pre id='lggsj'><style id='lggsj'><acronym id='lggsj'><i id='lggsj'><form id='lggsj'><option id='lggsj'><center id='lggsj'></center></option></form></i></acronym></style><tt id='lggsj'></tt></pre></legend></q></i></code><center id='lggsj'></center>

<dd id='lggsj'></dd>

<style id='lggsj'></style><sub id='lggsj'><dfn id='lggsj'><abbr id='lggsj'><big id='lggsj'><bdo id='lggsj'></bdo></big></abbr></dfn></sub>_{<dir id='lggsj'></dir>}