干逼视频无码免费网站_久久精品久久精品亚洲老牛影院_一级簧片网址免费_免费观看裸体美女网站_国产99偷激情在线视频_亚洲国产精品综合久久_永久免费华人在线视频网

Aging civil infrastructures are closely monitored by engineers for damage and critical defects. As the manual inspection of such large structures is costly and time-consuming, we are working towards fully automating the visual inspections to support the prioritization of maintenance activities. To that end we combine recent advances in drone technology and deep learning. Unfortunately, annotation costs are incredibly high as our proprietary civil engineering dataset must be annotated by highly trained engineers. Active learning is, therefore, a valuable tool to optimize the trade-off between model performance and annotation costs. Our use-case differs from the classical active learning setting as our dataset suffers from heavy class imbalance and consists of a much larger already labeled data pool than other active learning research. We present a novel method capable of operating in this challenging setting by replacing the traditional active learning acquisition function with an auxiliary binary discriminator. We experimentally show that our novel method outperforms the best-performing traditional active learning method (BALD) by 5% and 38% accuracy on CIFAR-10 and our proprietary dataset respectively.

相關內容

主動學習(xi)

關注 240

主(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)是(shi)機(ji)器學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)（更普遍的(de)(de)(de)說(shuo)是(shi)人工智能）的(de)(de)(de)一(yi)個(ge)(ge)子領(ling)域，在(zai)統計(ji)學(xue)(xue)(xue)(xue)(xue)領(ling)域也叫查(cha)詢學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)、最優實驗(yan)(yan)設計(ji)。“學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)模塊”和(he)“選擇策(ce)略”是(shi)主(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)算(suan)法的(de)(de)(de)2個(ge)(ge)基(ji)本且重要(yao)的(de)(de)(de)模塊。主(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)是(shi)“一(yi)種學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)方(fang)法，在(zai)這(zhe)種方(fang)法中，學(xue)(xue)(xue)(xue)(xue)生(sheng)會主(zhu)動(dong)(dong)或體驗(yan)(yan)性地參(can)與(yu)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)過(guo)程(cheng)，并且根據學(xue)(xue)(xue)(xue)(xue)生(sheng)的(de)(de)(de)參(can)與(yu)程(cheng)度(du)，有不(bu)同程(cheng)度(du)的(de)(de)(de)主(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指(zhi)出：“學(xue)(xue)(xue)(xue)(xue)生(sheng)除了(le)被動(dong)(dong)地聽(ting)課(ke)以外(wai)，還(huan)從事(shi)其他活動(dong)(dong)。” 在(zai)高等教育研究協會（ASHE）的(de)(de)(de)一(yi)份報告(gao)中，作(zuo)者(zhe)討論了(le)各種促(cu)進主(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)的(de)(de)(de)方(fang)法。他們(men)引用了(le)一(yi)些(xie)文獻(xian)，這(zhe)些(xie)文獻(xian)表明學(xue)(xue)(xue)(xue)(xue)生(sheng)不(bu)僅(jin)要(yao)做(zuo)聽(ting)，還(huan)必(bi)須(xu)(xu)做(zuo)更多的(de)(de)(de)事(shi)情(qing)才能學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)。他們(men)必(bi)須(xu)(xu)閱讀，寫作(zuo)，討論并參(can)與(yu)解決(jue)問(wen)題。此過(guo)程(cheng)涉及三(san)個(ge)(ge)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)領(ling)域，即(ji)知識，技能和(he)態度(du)（KSA）。這(zhe)種學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)行為(wei)(wei)分類法可以被認為(wei)(wei)是(shi)“學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)過(guo)程(cheng)的(de)(de)(de)目標”。特別是(shi)，學(xue)(xue)(xue)(xue)(xue)生(sheng)必(bi)須(xu)(xu)從事(shi)諸如分析(xi)，綜合和(he)評估(gu)之類的(de)(de)(de)高級(ji)思維(wei)任務。

標注 · 簇 · 狀態空間 · Learning · 秩 ·

2022 年 11 月 30 日

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

David Zhang,Micah Carroll,Andreea Bobu,Anca Dragan

from arxiv, Presented at the NeurIPS 2022 Human in the Loop Learning (HiLL) Workshop

One of the most successful paradigms for reward learning uses human feedback in the form of comparisons. Although these methods hold promise, human comparison labeling is expensive and time consuming, constituting a major bottleneck to their broader applicability. Our insight is that we can greatly improve how effectively human time is used in these approaches by batching comparisons together, rather than having the human label each comparison individually. To do so, we leverage data dimensionality-reduction and visualization techniques to provide the human with a interactive GUI displaying the state space, in which the user can label subportions of the state space. Across some simple Mujoco tasks, we show that this high-level approach holds promise and is able to greatly increase the performance of the resulting agents, provided the same amount of human labeling time.

Learning · 偽標記 · 標注 · 示例 · 正則化項 ·

2022 年 11 月 30 日

Learning with Partial Labels from Semi-supervised Perspective

Ximing Li,Yuanzhi Jiang,Changchun Li,Yiyuan Wang,Jihong Ouyang

Partial Label (PL) learning refers to the task of learning from the partially labeled data, where each training instance is ambiguously equipped with a set of candidate labels but only one is valid. Advances in the recent deep PL learning literature have shown that the deep learning paradigms, e.g., self-training, contrastive learning, or class activate values, can achieve promising performance. Inspired by the impressive success of deep Semi-Supervised (SS) learning, we transform the PL learning problem into the SS learning problem, and propose a novel PL learning method, namely Partial Label learning with Semi-supervised Perspective (PLSP). Specifically, we first form the pseudo-labeled dataset by selecting a small number of reliable pseudo-labeled instances with high-confidence prediction scores and treating the remaining instances as pseudo-unlabeled ones. Then we design a SS learning objective, consisting of a supervised loss for pseudo-labeled instances and a semantic consistency regularization for pseudo-unlabeled instances. We further introduce a complementary regularization for those non-candidate labels to constrain the model predictions on them to be as small as possible. Empirical results demonstrate that PLSP significantly outperforms the existing PL baseline methods, especially on high ambiguity levels. Code available: //github.com/changchunli/PLSP.

控制器 · Learning · 優化器 · 強化學習 · 部分可觀測馬爾可夫決策過程 ·

2022 年 11 月 29 日

Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach

Siddhant Bhambri,Amrita Bhattacharjee,Dimitri Bertsekas

In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) problems. These methods are based on approximation in value space and the rollout approach, admit a straightforward implementation, and provide improved performance over various heuristic approaches. For the Wordle puzzle, they yield on-line solution strategies that are very close to optimal at relatively modest computational cost. Our methods are viable for more complex versions of Wordle and related search problems, for which an optimal strategy would be impossible to compute. They are also applicable to a wide range of adaptive sequential decision problems that involve an unknown or frequently changing environment whose parameters are estimated on-line.

可交換的 · INTERACT · INFORMS · Performer · Analysis ·

2022 年 11 月 28 日

Efficient Update of Redundancy Matrices for Truss and Frame Structures

Tim Krake,Malte von Scheven,Jan Gade,Moataz Abdelaal,Daniel Weiskopf,Manfred Bischoff

Redundancy matrices provide insights into the load carrying behavior of statically indeterminate structures. This information can be employed for the design and analysis of structures with regard to certain objectives, for example reliability, robustness, or adaptability. In this context, the structure is often iteratively examined with the help of slight adjustments. However, this procedure generally requires a high computational effort for the recalculation of the redundancy matrix due to the necessity of costly matrix operations. This paper addresses this problem by providing generic algebraic formulations for efficiently updating the redundancy matrix (and related matrices). The formulations include various modifications like adding, removing, and exchanging elements and are applicable to truss and frame structures. With several examples, we demonstrate the interaction between the formulas and their mechanical interpretation. Finally, a performance test for a scaleable structure is presented.

2022 年 9 月 21 日

Deep Learning for Medical Image Segmentation: Tricks, Challenges and Future Directions

Dong Zhang,Yi Lin,Hao Chen,Zhuotao Tian,Xin Yang,Jinhui Tang,Kwang Ting Cheng

from arxiv, Under consideration

Over the past few years, the rapid development of deep learning technologies for computer vision has greatly promoted the performance of medical image segmentation (MedISeg). However, the recent MedISeg publications usually focus on presentations of the major contributions (e.g., network architectures, training strategies, and loss functions) while unwittingly ignoring some marginal implementation details (also known as "tricks"), leading to a potential problem of the unfair experimental result comparisons. In this paper, we collect a series of MedISeg tricks for different model implementation phases (i.e., pre-training model, data pre-processing, data augmentation, model implementation, model inference, and result post-processing), and experimentally explore the effectiveness of these tricks on the consistent baseline models. Compared to paper-driven surveys that only blandly focus on the advantages and limitation analyses of segmentation models, our work provides a large number of solid experiments and is more technically operable. With the extensive experimental results on both the representative 2D and 3D medical image datasets, we explicitly clarify the effect of these tricks. Moreover, based on the surveyed tricks, we also open-sourced a strong MedISeg repository, where each of its components has the advantage of plug-and-play. We believe that this milestone work not only completes a comprehensive and complementary survey of the state-of-the-art MedISeg approaches, but also offers a practical guide for addressing the future medical image processing challenges including but not limited to small dataset learning, class imbalance learning, multi-modality learning, and domain adaptation. The code has been released at: //github.com/hust-linyi/MedISeg

Processing（編程語言） · 深度強化學習 · 學成 · 強化學習 · INTERACT ·

2022 年 2 月 4 日

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Qingpeng Cai,Can Cui,Yiyuan Xiong,Wei Wang,Zhongle Xie,Meihui Zhang

from arxiv, 39 pages, 3 figures and 3 tables

Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many areas because it can learn better strategies in complicated environments it is interacting with than statically designed algorithms. Motivated by this trend, we provide a comprehensive review of recent works focusing on utilizing DRL to improve data processing and analytics. First, we present an introduction to key concepts, theories, and methods in DRL. Next, we discuss DRL deployment on database systems, facilitating data processing and analytics in various aspects, including data organization, scheduling, tuning, and indexing. Then, we survey the application of DRL in data processing and analytics, ranging from data preparation, natural language processing to healthcare, fintech, etc. Finally, we discuss important open challenges and future research directions of using DRL in data processing and analytics.

結構化學習 · 圖 · 稀疏 · 圖形處理器 · Neural Networks ·

2021 年 12 月 13 日

Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Yinhua Piao,Sangseon Lee,Dohoon Lee,Sun Kim

from arxiv, Accepted by AAAI 2022

Recently, graph neural networks (GNNs) have been widely used for document classification. However, most existing methods are based on static word co-occurrence graphs without sentence-level information, which poses three challenges:(1) word ambiguity, (2) word synonymity, and (3) dynamic contextual dependency. To address these challenges, we propose a novel GNN-based sparse structure learning model for inductive document classification. Specifically, a document-level graph is initially generated by a disjoint union of sentence-level word co-occurrence graphs. Our model collects a set of trainable edges connecting disjoint words between sentences and employs structure learning to sparsely select edges with dynamic contextual dependencies. Graphs with sparse structures can jointly exploit local and global contextual information in documents through GNNs. For inductive learning, the refined document graph is further fed into a general readout function for graph-level classification and optimization in an end-to-end manner. Extensive experiments on several real-world datasets demonstrate that the proposed model outperforms most state-of-the-art results, and reveal the necessity to learn sparse structures for each document.

樣本 · 類別 · 損失 · Performer · SimPLe ·

2019 年 1 月 16 日

Class-Balanced Loss Based on Effective Number of Samples

Yin Cui,Menglin Jia,Tsung-Yi Lin,Yang Song,Serge Belongie

from arxiv, Code is available at: //github.com/richardaecn/class-balanced-loss

With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of long-tailed data distribution (i.e., a few classes account for most of the data, while most classes are under-represented). Existing solutions typically adopt class re-balancing strategies such as re-sampling and re-weighting based on the number of observations for each class. In this work, we argue that as the number of samples increases, the additional benefit of a newly added data point will diminish. We introduce a novel theoretical framework to measure data overlap by associating with each sample a small neighboring region rather than a single point. The effective number of samples is defined as the volume of samples and can be calculated by a simple formula $(1-\beta^{n})/(1-\beta)$, where $n$ is the number of samples and $\beta \in [0,1)$ is a hyperparameter. We design a re-weighting scheme that uses the effective number of samples for each class to re-balance the loss, thereby yielding a class-balanced loss. Comprehensive experiments are conducted on artificially induced long-tailed CIFAR datasets and large-scale datasets including ImageNet and iNaturalist. Our results show that when trained with the proposed class-balanced loss, the network is able to achieve significant performance gains on long-tailed datasets.

深度強化學習 · 強化學習 · 學成 · 回合 · 優化器 ·

2018 年 6 月 27 日

A Multi-Objective Deep Reinforcement Learning Framework

Thanh Thi Nguyen

from arxiv, 17 pages

This paper presents a new multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We propose the use of linear and non-linear methods to develop the MODRL framework that includes both single-policy and multi-policy strategies. The experimental results on two benchmark problems including the two-objective deep sea treasure environment and the three-objective mountain car problem indicate that the proposed framework is able to converge to the optimal Pareto solutions effectively. The proposed framework is generic, which allows implementation of different deep reinforcement learning algorithms in different complex environments. This therefore overcomes many difficulties involved with standard multi-objective reinforcement learning (MORL) methods existing in the current literature. The framework creates a platform as a testbed environment to develop methods for solving various problems associated with the current MORL. Details of the framework implementation can be referred to //www.deakin.edu.au/~thanhthi/drl.htm.

Networking · 數據集 · 遷移學習 · 學成 · 可約的 ·

2018 年 5 月 10 日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Assia Benbihi,Matthieu Geist,Cédric Pradalier

Deep Convolutional Neural Networks have pushed the state-of-the art for semantic segmentation provided that a large amount of images together with pixel-wise annotations is available. Data collection is expensive and a solution to alleviate it is to use transfer learning. This reduces the amount of annotated data required for the network training but it does not get rid of this heavy processing step. We propose a method of transfer learning without annotations on the target task for datasets with redundant content and distinct pixel distributions. Our method takes advantage of the approximate content alignment of the images between two datasets when the approximation error prevents the reuse of annotation from one dataset to another. Given the annotations for only one dataset, we train a first network in a supervised manner. This network autonomously learns to generate deep data representations relevant to the semantic segmentation. Then the images in the new dataset, we train a new network to generate a deep data representation that matches the one from the first network on the previous dataset. The training consists in a regression between feature maps and does not require any annotations on the new dataset. We show that this method reaches performances similar to a classic transfer learning on the PASCAL VOC dataset with synthetic transformations.