青柠在线观看免费高清1-91婷婷国产精选国产色

This paper concerns diffraction-tomographic reconstruction of an object characterized by its scattering potential. We establish a rigorous generalization of the Fourier diffraction theorem in arbitrary dimension, giving a precise relation in the Fourier domain between measurements of the scattered wave and reconstructions of the scattering potential. With this theorem at hand, Fourier coverages for different experimental setups are investigated taking into account parameters such as object orientation, direction of incidence and frequency of illumination. Allowing for simultaneous and discontinuous variation of these parameters, a general filtered backpropagation formula is derived resulting in an explicit approximation of the scattering potential for a large class of experimental setups.

相關內容

反(fan)向(xiang)傳(chuan)播

關注 354

反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)一詞嚴格來(lai)說(shuo)僅指(zhi)用(yong)于(yu)(yu)計(ji)算(suan)(suan)(suan)(suan)梯(ti)度的(de)算(suan)(suan)(suan)(suan)法(fa)(fa)，而不是(shi)指(zhi)如何使用(yong)梯(ti)度。但(dan)是(shi)該術語通(tong)常被寬松地指(zhi)整個(ge)學習(xi)算(suan)(suan)(suan)(suan)法(fa)(fa)，包(bao)括(kuo)如何使用(yong)梯(ti)度，例(li)如通(tong)過隨機梯(ti)度下降。反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)將增量計(ji)算(suan)(suan)(suan)(suan)概括(kuo)為(wei)增量規(gui)則中(zhong)的(de)增量規(gui)則，該規(gui)則是(shi)反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)的(de)單(dan)層版本，然(ran)后通(tong)過自動微分進(jin)行(xing)廣義化(hua)，其(qi)中(zhong)反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)是(shi)反(fan)(fan)(fan)向(xiang)(xiang)累積（或“反(fan)(fan)(fan)向(xiang)(xiang)模式”）的(de)特例(li)。在(zai)機器學習(xi)中(zhong)，反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)（backprop）是(shi)一種廣泛用(yong)于(yu)(yu)訓練前饋神(shen)經(jing)網絡(luo)以進(jin)行(xing)監督學習(xi)的(de)算(suan)(suan)(suan)(suan)法(fa)(fa)。對(dui)于(yu)(yu)其(qi)他(ta)人工神(shen)經(jing)網絡(luo)（ANN）都存在(zai)反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)的(de)一般化(hua)–一類算(suan)(suan)(suan)(suan)法(fa)(fa)，通(tong)常稱為(wei)“反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)”。反(fan)(fan)(fan)向(xiang)(xiang)傳(chuan)播(bo)(bo)(bo)算(suan)(suan)(suan)(suan)法(fa)(fa)的(de)工作原理是(shi)，通(tong)過鏈規(gui)則計(ji)算(suan)(suan)(suan)(suan)損失函數相對(dui)于(yu)(yu)每(mei)個(ge)權重的(de)梯(ti)度，一次計(ji)算(suan)(suan)(suan)(suan)一層，從(cong)最(zui)后一層開始向(xiang)(xiang)后迭代，以避免(mian)鏈規(gui)則中(zhong)中(zhong)間(jian)項的(de)冗(rong)余(yu)計(ji)算(suan)(suan)(suan)(suan)。

Guidance · Performer · 知識 (knowledge) · INFORMS · motivation ·

2024 年 12 月 13 日

Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance

Jiahao Lyu,Wei Wang,Dongbao Yang,Jinwen Zhong,Yu Zhou

from arxiv, Accepted by AAAI2025

Scene text spotting has attracted the enthusiasm of relative researchers in recent years. Most existing scene text spotters follow the detection-then-recognition paradigm, where the vanilla detection module hardly determines the reading order and leads to failure recognition. After rethinking the auto-regressive scene text recognition method, we find that a well-trained recognizer can implicitly perceive the local semantics of all characters in a complete word or a sentence without a character-level detection module. Local semantic knowledge not only includes text content but also spatial information in the right reading order. Motivated by the above analysis, we propose the Local Semantics Guided scene text Spotter (LSGSpotter), which auto-regressively decodes the position and content of characters guided by the local semantics. Specifically, two effective modules are proposed in LSGSpotter. On the one hand, we design a Start Point Localization Module (SPLM) for locating text start points to determine the right reading order. On the other hand, a Multi-scale Adaptive Attention Module (MAAM) is proposed to adaptively aggregate text features in a local area. In conclusion, LSGSpotter achieves the arbitrary reading order spotting task without the limitation of sophisticated detection, while alleviating the cost of computational resources with the grid sampling strategy. Extensive experiment results show LSGSpotter achieves state-of-the-art performance on the InverseText benchmark. Moreover, our spotter demonstrates superior performance on English benchmarks for arbitrary-shaped text, achieving improvements of 0.7\% and 2.5\% on Total-Text and SCUT-CTW1500, respectively. These results validate our text spotter is effective for scene texts in arbitrary reading order and shape.

Integration · Networking · 6G · Wireless Networks · 泛函 ·

2024 年 12 月 12 日

Towards Distributed and Intelligent Integrated Sensing and Communications for 6G Networks

Emilio Calvanese Strinati,George C. Alexandropoulos,Navid Amani,Maurizio Crozzoli,Giyyarpuram Madhusudan,Sami Mekki,Francois Rivet,Vincenzo Sciancalepore,Philippe Sehier,Maximilian Stark,Henk Wymeersch

from arxiv, Accepted to IEEE WCM

This paper introduces the distributed and intelligent integrated sensing and communications (DISAC) concept, a transformative approach for 6G wireless networks that extends the emerging concept of integrated sensing and communications (ISAC). DISAC addresses the limitations of the existing ISAC models and, to overcome them, it introduces two novel foundational functionalities for both sensing and communications: a distributed architecture (enabling large-scale and energy-efficient tracking of connected users and objects, leveraging the fusion of heterogeneous sensors) and a semantic and goal-oriented framework (enabling the transition from classical data fusion to the composition of semantically selected information).

查準率/準確率 · 分解的 · 估計/估計量 · Processing（編程語言） · 樣本復雜度 ·

2024 年 12 月 11 日

Precision and Cholesky Factor Estimation for Gaussian Processes

Jiaheng Chen,Daniel Sanz-Alonso

from arxiv, 28 pages

This paper studies the estimation of large precision matrices and Cholesky factors obtained by observing a Gaussian process at many locations. Under general assumptions on the precision and the observations, we show that the sample complexity scales poly-logarithmically with the size of the precision matrix and its Cholesky factor. The key challenge in these estimation tasks is the polynomial growth of the condition number of the target matrices with their size. For precision estimation, our theory hinges on an intuitive local regression technique on the lattice graph which exploits the approximate sparsity implied by the screening effect. For Cholesky factor estimation, we leverage a block-Cholesky decomposition recently used to establish complexity bounds for sparse Cholesky factorization.

估計/估計量 · contrastive · INFORMS · 互信息 · 表示學習 ·

2021 年 6 月 25 日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Alessandro Sordoni,Nouha Dziri,Hannes Schulz,Geoff Gordon,Phil Bachman,Remi Tachet

from arxiv, ICML 2021

Recent contrastive representation learning methods rely on estimating mutual information (MI) between multiple views of an underlying context. E.g., we can derive multiple views of a given image by applying data augmentation, or we can split a sequence into views comprising the past and future of some step in the sequence. Contrastive lower bounds on MI are easy to optimize, but have a strong underestimation bias when estimating large amounts of MI. We propose decomposing the full MI estimation problem into a sum of smaller estimation problems by splitting one of the views into progressively more informed subviews and by applying the chain rule on MI between the decomposed views. This expression contains a sum of unconditional and conditional MI terms, each measuring modest chunks of the total MI, which facilitates approximation via contrastive bounds. To maximize the sum, we formulate a contrastive lower bound on the conditional MI which can be approximated efficiently. We refer to our general approach as Decomposed Estimation of Mutual Information (DEMI). We show that DEMI can capture a larger amount of MI than standard non-decomposed contrastive bounds in a synthetic setting, and learns better representations in a vision domain and for dialogue generation.

FRN · INFORMS · Networking · MoDELS · 學成 ·

2021 年 4 月 12 日

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

Delian Ruan, YanYan,Shenqi Lai,Zhenhua Chai,Chunhua Shen,Hanzi Wang

from arxiv, IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021 (CVPR 2021)

In this paper, we propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition. We view the expression information as the combination of the shared information (expression similarities) across different expressions and the unique information (expression-specific variations) for each expression. More specifically, FDRL mainly consists of two crucial networks: a Feature Decomposition Network (FDN) and a Feature Reconstruction Network (FRN). In particular, FDN first decomposes the basic features extracted from a backbone network into a set of facial action-aware latent features to model expression similarities. Then, FRN captures the intra-feature and inter-feature relationships for latent features to characterize expression-specific variations, and reconstructs the expression feature. To this end, two modules including an intra-feature relation modeling module and an inter-feature relation modeling module are developed in FRN. Experimental results on both the in-the-lab databases (including CK+, MMI, and Oulu-CASIA) and the in-the-wild databases (including RAF-DB and SFEW) show that the proposed FDRL method consistently achieves higher recognition accuracy than several state-of-the-art methods. This clearly highlights the benefit of feature decomposition and reconstruction for classifying expressions.

圖 · 鏈路預測 · 正交 · 知識圖譜 · Better ·

2020 年 4 月 15 日

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Yun Tang,Jing Huang,Guangtao Wang,Xiaodong He,Bowen Zhou

from arxiv, Accepted by ACL 2020

Translational distance-based knowledge graph embedding has shown progressive improvements on the link prediction task, from TransE to the latest state-of-the-art RotatE. However, N-1, 1-N and N-N predictions still remain challenging. In this work, we propose a novel translational distance-based approach for knowledge graph link prediction. The proposed method includes two-folds, first we extend the RotatE from 2D complex domain to high dimension space with orthogonal transforms to model relations for better modeling capacity. Second, the graph context is explicitly modeled via two directed context representations. These context representations are used as part of the distance scoring function to measure the plausibility of the triples during training and inference. The proposed approach effectively improves prediction accuracy on the difficult N-1, 1-N and N-N cases for knowledge graph link prediction task. The experimental results show that it achieves better performance on two benchmark data sets compared to the baseline RotatE, especially on data set (FB15k-237) with many high in-degree connection nodes.

entity · 圖 · 知識圖譜 · state-of-the-art · 可辨認的 ·

2020 年 2 月 17 日

Entity Context and Relational Paths for Knowledge Graph Completion

Hongwei Wang,Hongyu Ren,Jure Leskovec

Knowledge graph completion aims to predict missing relations between entities in a knowledge graph. While many different methods have been proposed, there is a lack of a unifying framework that would lead to state-of-the-art results. Here we develop PathCon, a knowledge graph completion method that harnesses four novel insights to outperform existing methods. PathCon predicts relations between a pair of entities by: (1) Considering the Relational Context of each entity by capturing the relation types adjacent to the entity and modeled through a novel edge-based message passing scheme; (2) Considering the Relational Paths capturing all paths between the two entities; And, (3) adaptively integrating the Relational Context and Relational Path through a learnable attention mechanism. Importantly, (4) in contrast to conventional node-based representations, PathCon represents context and path only using the relation types, which makes it applicable in an inductive setting. Experimental results on knowledge graph benchmarks as well as our newly proposed dataset show that PathCon outperforms state-of-the-art knowledge graph completion methods by a large margin. Finally, PathCon is able to provide interpretable explanations by identifying relations that provide the context and paths that are important for a given predicted relation.

MoDELS · BERT · 模型評估 · 語言表示 · 泛化理論 ·

2019 年 2 月 28 日

BERT for Joint Intent Classification and Slot Filling

Qian Chen,Zhu Zhuo,Wen Wang

from arxiv, 4 pages, 1 figure

Intent classification and slot filling are two essential tasks for natural language understanding. They often suffer from small-scale human-labeled training data, resulting in poor generalization capability, especially for rare words. Recently a new language representation model, BERT (Bidirectional Encoder Representations from Transformers), facilitates pre-training deep bidirectional representations on large-scale unlabeled corpora, and has created state-of-the-art models for a wide variety of natural language processing tasks after simple fine-tuning. However, there has not been much effort on exploring BERT for natural language understanding. In this work, we propose a joint intent classification and slot filling model based on BERT. Experimental results demonstrate that our proposed model achieves significant improvement on intent classification accuracy, slot filling F1, and sentence-level semantic frame accuracy on several public benchmark datasets, compared to the attention-based recurrent neural network models and slot-gated models.

圖片分類 · 生成式對抗網絡 · Networking · 未標記 · GANs ·

2018 年 2 月 10 日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Zilong Zhong,Jonathan Li

from arxiv, Accepted by AAAI-18

High spectral dimensionality and the shortage of annotations make hyperspectral image (HSI) classification a challenging problem. Recent studies suggest that convolutional neural networks can learn discriminative spatial features, which play a paramount role in HSI interpretation. However, most of these methods ignore the distinctive spectral-spatial characteristic of hyperspectral data. In addition, a large amount of unlabeled data remains an unexploited gold mine for efficient data use. Therefore, we proposed an integration of generative adversarial networks (GANs) and probabilistic graphical models for HSI classification. Specifically, we used a spectral-spatial generator and a discriminator to identify land cover categories of hyperspectral cubes. Moreover, to take advantage of a large amount of unlabeled data, we adopted a conditional random field to refine the preliminary classification results generated by GANs. Experimental results obtained using two commonly studied datasets demonstrate that the proposed framework achieved encouraging classification accuracy using a small number of data for training.

圖 · 卷積神經網絡 · 學成 · SimPLe · 結點 ·

2017 年 11 月 22 日

Interpreting CNN Knowledge via an Explanatory Graph

Quanshi Zhang,Ruiming Cao,Feng Shi,Ying Nian Wu,Song-Chun Zhu

from arxiv, in AAAI 2018

This paper learns a graphical model, namely an explanatory graph, which reveals the knowledge hierarchy hidden inside a pre-trained CNN. Considering that each filter in a conv-layer of a pre-trained CNN usually represents a mixture of object parts, we propose a simple yet efficient method to automatically disentangles different part patterns from each filter, and construct an explanatory graph. In the explanatory graph, each node represents a part pattern, and each edge encodes co-activation relationships and spatial relationships between patterns. More importantly, we learn the explanatory graph for a pre-trained CNN in an unsupervised manner, i.e., without a need of annotating object parts. Experiments show that each graph node consistently represents the same object part through different images. We transfer part patterns in the explanatory graph to the task of part localization, and our method significantly outperforms other approaches.