亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='5vl83'><strong id='5vl83'></strong><small id='5vl83'></small><button id='5vl83'></button><li id='5vl83'><noscript id='5vl83'><big id='5vl83'></big><dt id='5vl83'></dt></noscript></li></tr><ol id='5vl83'><option id='5vl83'><table id='5vl83'><blockquote id='5vl83'><tbody id='5vl83'></tbody></blockquote></table></option></ol><u id='5vl83'></u><kbd id='5vl83'><kbd id='5vl83'></kbd></kbd>

<code id='5vl83'><strong id='5vl83'></strong></code>

<fieldset id='5vl83'></fieldset>

<span id='5vl83'></span>

<ins id='5vl83'></ins>

<acronym id='5vl83'><em id='5vl83'></em><td id='5vl83'><div id='5vl83'></div></td></acronym><address id='5vl83'><big id='5vl83'><big id='5vl83'></big><legend id='5vl83'></legend></big></address>

<i id='5vl83'><div id='5vl83'><ins id='5vl83'></ins></div></i>

<i id='5vl83'></i>

·

損失函數（機器學習） · 泛函 · 損失 · Networking · Neural Networks ·

2023 年 6 月 24 日

Current density impedance imaging with PINNs

Chenguang Duan,Yuling Jiao,Xiliang Lu,Jerry Zhijian Yang

In this paper, we introduce CDII-PINNs, a computationally efficient method for solving CDII using PINNs in the framework of Tikhonov regularization. This method constructs a physics-informed loss function by merging the regularized least-squares output functional with an underlying differential equation, which describes the relationship between the conductivity and voltage. A pair of neural networks representing the conductivity and voltage, respectively, are coupled by this loss function. Then, minimizing the loss function provides a reconstruction. A rigorous theoretical guarantee is provided. We give an error analysis for CDII-PINNs and establish a convergence rate, based on prior selected neural network parameters in terms of the number of samples. The numerical simulations demonstrate that CDII-PINNs are efficient, accurate and robust to noise levels ranging from $1\%$ to $20\%$.

相關內容

損失函數（機器學習）

損失函(han)數（機器學(xue)習）

損失(shi)函(han)(han)數(shu)(shu)(shu)，在AI中亦稱呼距(ju)離函(han)(han)數(shu)(shu)(shu)，度量(liang)函(han)(han)數(shu)(shu)(shu)。此處的距(ju)離代表的是(shi)抽象性的，代表真實數(shu)(shu)(shu)據與預(yu)測數(shu)(shu)(shu)據之間(jian)的誤(wu)差。損失(shi)函(han)(han)數(shu)(shu)(shu)（loss function）是(shi)用(yong)來估量(liang)你(ni)模型的預(yu)測值(zhi)f(x)與真實值(zhi)Y的不一(yi)致程度，它(ta)是(shi)一(yi)個非負實值(zhi)函(han)(han)數(shu)(shu)(shu),通常(chang)使用(yong)L(Y, f(x))來表示，損失(shi)函(han)(han)數(shu)(shu)(shu)越(yue)(yue)小(xiao)，模型的魯(lu)棒(bang)性就越(yue)(yue)好。損失(shi)函(han)(han)數(shu)(shu)(shu)是(shi)經(jing)驗(yan)風險函(han)(han)數(shu)(shu)(shu)的核心部分(fen)，也是(shi)結構風險函(han)(han)數(shu)(shu)(shu)重要組成部分(fen)。

結構化學習 · Learning · DAG · MCMC · 近似 ·

2023 年 8 月 16 日

Order-based Structure Learning without Score Equivalence

Hyunwoong Chang,James Cai,Quan Zhou

We propose an empirical Bayes formulation of the structure learning problem, where the prior specification assumes that all node variables have the same error variance, an assumption known to ensure the identifiability of the underlying causal directed acyclic graph (DAG). To facilitate efficient posterior computation, we approximate the posterior probability of each ordering by that of a best DAG model, which naturally leads to an order-based Markov chain Monte Carlo (MCMC) algorithm. Strong selection consistency for our model in high-dimensional settings is proved under a condition that allows heterogeneous error variances, and the mixing behavior of our sampler is theoretically investigated. Further, we propose a new iterative top-down algorithm, which quickly yields an approximate solution to the structure learning problem and can be used to initialize the MCMC sampler. We demonstrate that our method outperforms other state-of-the-art algorithms under various simulation settings, and conclude the paper with a single-cell real-data study illustrating practical advantages of the proposed method.

Learning · 可約的 · Performer · 語言模型化 · Boosting（一種模型訓練加速方式） ·

2023 年 8 月 16 日

Pre-training with Large Language Model-based Document Expansion for Dense Passage Retrieval

Guangyuan Ma,Xing Wu,Peng Wang,Zijia Lin,Songlin Hu

from arxiv, 10 pages, 3 tables, 4 figures, under review

In this paper, we systematically study the potential of pre-training with Large Language Model(LLM)-based document expansion for dense passage retrieval. Concretely, we leverage the capabilities of LLMs for document expansion, i.e. query generation, and effectively transfer expanded knowledge to retrievers using pre-training strategies tailored for passage retrieval. These strategies include contrastive learning and bottlenecked query generation. Furthermore, we incorporate a curriculum learning strategy to reduce the reliance on LLM inferences. Experimental results demonstrate that pre-training with LLM-based document expansion significantly boosts the retrieval performance on large-scale web-search tasks. Our work shows strong zero-shot and out-of-domain retrieval abilities, making it more widely applicable for retrieval when initializing with no human-labeled data.

正則化項 · 協變量偏移 · INFORMS · 估計/估計量 · 模型評估 ·

2023 年 8 月 15 日

On regularized Radon-Nikodym differentiation

Duc Hoan Nguyen,Werner Zellinger,Sergei V. Pereverzyev

from arxiv, arXiv admin note: text overlap with arXiv:2307.11503

We discuss the problem of estimating Radon-Nikodym derivatives. This problem appears in various applications, such as covariate shift adaptation, likelihood-ratio testing, mutual information estimation, and conditional probability estimation. To address the above problem, we employ the general regularization scheme in reproducing kernel Hilbert spaces. The convergence rate of the corresponding regularized algorithm is established by taking into account both the smoothness of the derivative and the capacity of the space in which it is estimated. This is done in terms of general source conditions and the regularized Christoffel functions. We also find that the reconstruction of Radon-Nikodym derivatives at any particular point can be done with high order of accuracy. Our theoretical results are illustrated by numerical simulations.

結點 · Learning · MoDELS · 層 · 集成 ·

2023 年 8 月 15 日

Self-supervised Hypergraphs for Learning Multiple World Interpretations

Alina Marcu,Mihai Pirvu,Dragos Costea,Emanuela Haller,Emil Slusanschi,Ahmed Nabil Belbachir,Rahul Sukthankar,Marius Leordeanu

We present a method for learning multiple scene representations given a small labeled set, by exploiting the relationships between such representations in the form of a multi-task hypergraph. We also show how we can use the hypergraph to improve a powerful pretrained VisTransformer model without any additional labeled data. In our hypergraph, each node is an interpretation layer (e.g., depth or segmentation) of the scene. Within each hyperedge, one or several input nodes predict the layer at the output node. Thus, each node could be an input node in some hyperedges and an output node in others. In this way, multiple paths can reach the same node, to form ensembles from which we obtain robust pseudolabels, which allow self-supervised learning in the hypergraph. We test different ensemble models and different types of hyperedges and show superior performance to other multi-task graph models in the field. We also introduce Dronescapes, a large video dataset captured with UAVs in different complex real-world scenes, with multiple representations, suitable for multi-task learning.

估計/估計量 · Markov · 馬爾可夫鏈 · 可逆馬爾可夫鏈 · 方差 ·

2023 年 8 月 15 日

Efficient shape-constrained inference for the autocovariance sequence from a reversible Markov chain

Stephen Berg,Hyebin Song

In this paper, we study the problem of estimating the autocovariance sequence resulting from a reversible Markov chain. A motivating application for studying this problem is the estimation of the asymptotic variance in central limit theorems for Markov chains. We propose a novel shape-constrained estimator of the autocovariance sequence, which is based on the key observation that the representability of the autocovariance sequence as a moment sequence imposes certain shape constraints. We examine the theoretical properties of the proposed estimator and provide strong consistency guarantees for our estimator. In particular, for geometrically ergodic reversible Markov chains, we show that our estimator is strongly consistent for the true autocovariance sequence with respect to an $\ell_2$ distance, and that our estimator leads to strongly consistent estimates of the asymptotic variance. Finally, we perform empirical studies to illustrate the theoretical properties of the proposed estimator as well as to demonstrate the effectiveness of our estimator in comparison with other current state-of-the-art methods for Markov chain Monte Carlo variance estimation, including batch means, spectral variance estimators, and the initial convex sequence estimator.

穩健性 · 優化器 · 主動學習 · 有偏 · 樣例 ·

2023 年 8 月 14 日

Robust expected improvement for Bayesian optimization

Ryan B. Christianson,Robert B. Gramacy

from arxiv, 27 pages, 17 figures, 1 table

Bayesian Optimization (BO) links Gaussian Process (GP) surrogates with sequential design toward optimizing expensive-to-evaluate black-box functions. Example design heuristics, or so-called acquisition functions, like expected improvement (EI), balance exploration and exploitation to furnish global solutions under stringent evaluation budgets. However, they fall short when solving for robust optima, meaning a preference for solutions in a wider domain of attraction. Robust solutions are useful when inputs are imprecisely specified, or where a series of solutions is desired. A common mathematical programming technique in such settings involves an adversarial objective, biasing a local solver away from ``sharp'' troughs. Here we propose a surrogate modeling and active learning technique called robust expected improvement (REI) that ports adversarial methodology into the BO/GP framework. After describing the methods, we illustrate and draw comparisons to several competitors on benchmark synthetic exercises and real problems of varying complexity.

contrastive · 變換 · 學成 · 判別器 · Performer ·

2020 年 12 月 9 日

Contrastive Transformation for Self-supervised Correspondence Learning

Ning Wang,Wengang Zhou,Houqiang Li

from arxiv, To appear in AAAI 2021

In this paper, we focus on the self-supervised learning of visual correspondence using unlabeled videos in the wild. Our method simultaneously considers intra- and inter-video representation associations for reliable correspondence estimation. The intra-video learning transforms the image contents across frames within a single video via the frame pair-wise affinity. To obtain the discriminative representation for instance-level separation, we go beyond the intra-video analysis and construct the inter-video affinity to facilitate the contrastive transformation across different videos. By forcing the transformation consistency between intra- and inter-video levels, the fine-grained correspondence associations are well preserved and the instance-level feature discrimination is effectively reinforced. Our simple framework outperforms the recent self-supervised correspondence methods on a range of visual tasks including video object tracking (VOT), video object segmentation (VOS), pose keypoint tracking, etc. It is worth mentioning that our method also surpasses the fully-supervised affinity representation (e.g., ResNet) and performs competitively against the recent fully-supervised algorithms designed for the specific tasks (e.g., VOT and VOS).

圖 · 知識圖譜 · 鏈路預測 · INFORMS · binary ·

2020 年 1 月 2 日

Reasoning on Knowledge Graphs with Debate Dynamics

Marcel Hildebrandt,Jorge Andres Quintero Serna,Yunpu Ma,Martin Ringsquandl,Mitchell Joblin,Volker Tresp

from arxiv, AAAI-2020

We propose a novel method for automatic reasoning on knowledge graphs based on debate dynamics. The main idea is to frame the task of triple classification as a debate game between two reinforcement learning agents which extract arguments -- paths in the knowledge graph -- with the goal to promote the fact being true (thesis) or the fact being false (antithesis), respectively. Based on these arguments, a binary classifier, called the judge, decides whether the fact is true or false. The two agents can be considered as sparse, adversarial feature generators that present interpretable evidence for either the thesis or the antithesis. In contrast to other black-box methods, the arguments allow users to get an understanding of the decision of the judge. Since the focus of this work is to create an explainable method that maintains a competitive predictive accuracy, we benchmark our method on the triple classification and link prediction task. Thereby, we find that our method outperforms several baselines on the benchmark datasets FB15k-237, WN18RR, and Hetionet. We also conduct a survey and find that the extracted arguments are informative for users.

BERT · Performer · Transformer模型 · SimPLe · HTTPS ·

2019 年 3 月 25 日

Fine-tune BERT for Extractive Summarization

BERT, a pre-trained Transformer model, has achieved ground-breaking performance on multiple NLP tasks. In this paper, we describe BERTSUM, a simple variant of BERT, for extractive summarization. Our system is the state of the art on the CNN/Dailymail dataset, outperforming the previous best-performed system by 1.65 on ROUGE-L. The codes to reproduce our results are available at //github.com/nlpyang/BertSum

Softmax · 邊緣化 · Performer · Better · state-of-the-art ·

2018 年 1 月 18 日

Additive Margin Softmax for Face Verification

Feng Wang,Weiyang Liu,Haijun Liu,Jian Cheng

from arxiv, technical report

In this paper, we propose a conceptually simple and geometrically interpretable objective function, i.e. additive margin Softmax (AM-Softmax), for deep face verification. In general, the face verification task can be viewed as a metric learning problem, so learning large-margin face features whose intra-class variation is small and inter-class difference is large is of great importance in order to achieve good performance. Recently, Large-margin Softmax and Angular Softmax have been proposed to incorporate the angular margin in a multiplicative manner. In this work, we introduce a novel additive angular margin for the Softmax loss, which is intuitively appealing and more interpretable than the existing works. We also emphasize and discuss the importance of feature normalization in the paper. Most importantly, our experiments on LFW BLUFR and MegaFace show that our additive margin softmax loss consistently performs better than the current state-of-the-art methods using the same network architecture and training dataset. Our code has also been made available at //github.com/happynear/AMSoftmax

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

損(sun)失函數（機器(qi)學習(xi)）

Neural Networks

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='1Qcb8'><del id='2MFpx'><del id='S2Wes'></del><pre id='bhEnt'><pre id='cJDuO'><option id='04Ch1'><address id='MyHwj'></address><bdo id='0l9RU'><tr id='Ur8TW'><acronym id='xAmov'><pre id='zsjiN'></pre></acronym><div id='rmSw9'></div></tr></bdo></option></pre><small id='8kxzV'><address id='uuz6s'><u id='1iXCI'><legend id='VLUHr'><option id='IQoVA'><abbr id='uoyT3'></abbr><li id='WDLSW'><pre id='K4H9C'></pre></li></option></legend><select id='dar6F'></select></u></address></small></pre></del><sup id='rERJ3'></sup><blockquote id='PCkfz'><dt id='r7M9B'></dt></blockquote><blockquote id='g6QcG'></blockquote></dir><tt id='wHeP9'></tt><u id='2dVfn'><tt id='Izsqj'><form id='ZGY2n'></form></tt><td id='usGtF'><dt id='OrJHv'></dt></td></u>

<code id='el9bs'><i id='otR3O'><q id='40dta'><legend id='l7f5X'><pre id='xLsnM'><style id='sp6b8'><acronym id='bhbKl'><i id='oywez'><form id='WAwAa'><option id='k7JYA'><center id='Kvvo4'></center></option></form></i></acronym></style><tt id='C3Uz0'></tt></pre></legend></q></i></code><center id='9xP5N'></center>

<dd id='o3w3N'></dd>

<style id='ROIzL'></style><sub id='vgmeo'><dfn id='DTwaL'><abbr id='xu8yl'><big id='bN6Bc'><bdo id='zqwnz'></bdo></big></abbr></dfn></sub>_{<dir id='fT9OF'></dir>}