国产亚洲欧美日韩精品色狠二区_久久国产乱子伦精品噜噜_亚洲热码中文字幕视频_性猛69式交富婆乱大交_欧美韩国日本国产一区二_亚洲日韩国产成人精品无码区_日本免费福利视频

The universal approximation theorem states that a neural network with one hidden layer can approximate continuous functions on compact sets with any desired precision. This theorem supports using neural networks for various applications, including regression and classification tasks. Furthermore, it is valid for real-valued neural networks and some hypercomplex-valued neural networks such as complex-, quaternion-, tessarine-, and Clifford-valued neural networks. However, hypercomplex-valued neural networks are a type of vector-valued neural network defined on an algebra with additional algebraic or geometric properties. This paper extends the universal approximation theorem for a wide range of vector-valued neural networks, including hypercomplex-valued models as particular instances. Precisely, we introduce the concept of non-degenerate algebra and state the universal approximation theorem for neural networks defined on such algebras.

相關內容

Neural Networks

關注 1648

神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)（Neural Networks）是世界上三(san)個(ge)最古老的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)建模學(xue)(xue)會(hui)的(de)(de)(de)(de)(de)檔案期刊:國際(ji)神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)會(hui)(INNS)、歐洲神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)會(hui)(ENNS)和日(ri)本神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)會(hui)(JNNS)。神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)提(ti)供(gong)了一(yi)個(ge)論(lun)壇，以(yi)發(fa)(fa)展和培育一(yi)個(ge)國際(ji)社會(hui)的(de)(de)(de)(de)(de)學(xue)(xue)者和實(shi)踐者感興(xing)趣(qu)的(de)(de)(de)(de)(de)所(suo)有(you)(you)方(fang)面的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)和相關方(fang)法(fa)的(de)(de)(de)(de)(de)計算(suan)智(zhi)(zhi)能。神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)歡迎高質(zhi)量論(lun)文的(de)(de)(de)(de)(de)提(ti)交，有(you)(you)助于全(quan)面的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)研究，從行為和大(da)腦建模，學(xue)(xue)習算(suan)法(fa)，通過數學(xue)(xue)和計算(suan)分(fen)析，系統(tong)的(de)(de)(de)(de)(de)工程和技術(shu)(shu)應用，大(da)量使(shi)用神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)的(de)(de)(de)(de)(de)概念和技術(shu)(shu)。這一(yi)獨特而(er)廣(guang)泛的(de)(de)(de)(de)(de)范圍促進了生物和技術(shu)(shu)研究之(zhi)間的(de)(de)(de)(de)(de)思(si)想交流(liu)，并有(you)(you)助于促進對生物啟發(fa)(fa)的(de)(de)(de)(de)(de)計算(suan)智(zhi)(zhi)能感興(xing)趣(qu)的(de)(de)(de)(de)(de)跨學(xue)(xue)科社區的(de)(de)(de)(de)(de)發(fa)(fa)展。因此(ci)，神(shen)經(jing)(jing)(jing)網(wang)(wang)(wang)絡(luo)(luo)(luo)編(bian)委會(hui)代(dai)表的(de)(de)(de)(de)(de)專家(jia)領域包(bao)括心理學(xue)(xue)，神(shen)經(jing)(jing)(jing)生物學(xue)(xue)，計算(suan)機(ji)科學(xue)(xue)，工程，數學(xue)(xue)，物理。該雜志(zhi)發(fa)(fa)表文章、信件(jian)和評論(lun)以(yi)及給(gei)編(bian)輯的(de)(de)(de)(de)(de)信件(jian)、社論(lun)、時事、軟件(jian)調查(cha)和專利信息。文章發(fa)(fa)表在(zai)五個(ge)部分(fen)之(zhi)一(yi):認知(zhi)科學(xue)(xue)，神(shen)經(jing)(jing)(jing)科學(xue)(xue)，學(xue)(xue)習系統(tong)，數學(xue)(xue)和計算(suan)分(fen)析、工程和應用。官網(wang)(wang)(wang)地址：

Networking · 優化器 · 有向 · 樣例 · Networks ·

2024 年 10 月 1 日

Decentralized Optimization in Time-Varying Networks with Arbitrary Delays

Tomas Ortega,Hamid Jafarkhani

from arxiv, arXiv admin note: text overlap with arXiv:2401.11344

We consider a decentralized optimization problem for networks affected by communication delays. Examples of such networks include collaborative machine learning, sensor networks, and multi-agent systems. To mimic communication delays, we add virtual non-computing nodes to the network, resulting in directed graphs. This motivates investigating decentralized optimization solutions on directed graphs. Existing solutions assume nodes know their out-degrees, resulting in limited applicability. To overcome this limitation, we introduce a novel gossip-based algorithm, called DT-GO, that does not need to know the out-degrees. The algorithm is applicable in general directed networks, for example networks with delays or limited acknowledgment capabilities. We derive convergence rates for both convex and non-convex objectives, showing that our algorithm achieves the same complexity order as centralized Stochastic Gradient Descent. In other words, the effects of the graph topology and delays are confined to higher-order terms. Additionally, we extend our analysis to accommodate time-varying network topologies. Numerical simulations are provided to support our theoretical findings.

INFORMS · Networking · 邊 · 優化器 · 圖 ·

2024 年 9 月 27 日

Optimizing Information Access in Networks via Edge Augmentation

Aditya Bhaskara,Alex Crane,Shweta Jain,Md Mumtahin Habib Ullah Mazumder,Blair D. Sullivan,Prasanth Yalamanchili

from arxiv, Version 2 adds a new single-criteria approximation

Given a graph $G = (V, E)$ and a model of information flow on that network, a fundamental question is to understand whether all nodes have sufficient access to information generated at other nodes in the graph. If not, we can ask if a small set of interventions in the form of edge additions improve information access. Formally, the broadcast value of a network is defined to be the minimum over pairs $u,v \in V$ of the probability that an information cascade starting at $u$ reaches $v$. Having a high broadcast value ensures that every node has sufficient access to information spreading in a network, thus quantifying fairness of access. In this paper, we formally study the Broadcast Improvement problem: given $G$ and a parameter $k$, the goal is to find the best set of $k$ edges to add to $G$ in order to maximize the broadcast value of the resulting graph. We develop efficient approximation algorithms for this problem. If the optimal solution adds $k$ edges and achieves a broadcast of $\beta^*$, we develop algorithms that can (a) add $k$ edges and achieve a broadcast value roughly $(\beta^*)^4/16^k$, or (b) add $O(k\log n)$ edges and achieve a broadcast roughly $\beta^*$. We also provide other trade-offs that can be better depending on the parameter values. Our algorithms rely on novel probabilistic tools to reason about the existence of paths in edge-sampled graphs, and extend to a single-source variant of the problem, where we obtain analogous algorithmic results. We complement our results by proving that unless P = NP, any algorithm that adds $O(k)$ edges must lose significantly in the approximation of $\beta^*$, resolving an open question from prior work.

優化器 · Agent · 擬牛頓法 · 目標函數 · Networking ·

2024 年 9 月 26 日

Distributed Quasi-Newton Method for Multi-Agent Optimization

Ola Shorinwa,Mac Schwager

We present a distributed quasi-Newton (DQN) method, which enables a group of agents to compute an optimal solution of a separable multi-agent optimization problem locally using an approximation of the curvature of the aggregate objective function. Each agent computes a descent direction from its local estimate of the aggregate Hessian, obtained from quasi-Newton approximation schemes using the gradient of its local objective function. Moreover, we introduce a distributed quasi-Newton method for equality-constrained optimization (EC-DQN), where each agent takes Karush-Kuhn-Tucker-like update steps to compute an optimal solution. In our algorithms, each agent communicates with its one-hop neighbors over a peer-to-peer communication network to compute a common solution. We prove convergence of our algorithms to a stationary point of the optimization problem. In addition, we demonstrate the competitive empirical convergence of our algorithm in both well-conditioned and ill-conditioned optimization problems, in terms of the computation time and communication cost incurred by each agent for convergence, compared to existing distributed first-order and second-order methods. Particularly, in ill-conditioned problems, our algorithms achieve a faster computation time for convergence, while requiring a lower communication cost, across a range of communication networks with different degrees of connectedness.

圖形處理器 · 圖 · 可辨認的 · Neural Networks · Networking ·

2021 年 5 月 31 日

On Explainability of Graph Neural Networks via Subgraph Explorations

Hao Yuan,Haiyang Yu,Jie Wang,Kang Li,Shuiwang Ji

from arxiv, Accepted by ICML 2021

We consider the problem of explaining the predictions of graph neural networks (GNNs), which otherwise are considered as black boxes. Existing methods invariably focus on explaining the importance of graph nodes or edges but ignore the substructures of graphs, which are more intuitive and human-intelligible. In this work, we propose a novel method, known as SubgraphX, to explain GNNs by identifying important subgraphs. Given a trained GNN model and an input graph, our SubgraphX explains its predictions by efficiently exploring different subgraphs with Monte Carlo tree search. To make the tree search more effective, we propose to use Shapley values as a measure of subgraph importance, which can also capture the interactions among different subgraphs. To expedite computations, we propose efficient approximation schemes to compute Shapley values for graph data. Our work represents the first attempt to explain GNNs via identifying subgraphs explicitly and directly. Experimental results show that our SubgraphX achieves significantly improved explanations, while keeping computations at a reasonable level.

泛化理論 · Extensibility · state-of-the-art · 測試數據 · 學成 ·

2021 年 4 月 16 日

Deep Stable Learning for Out-Of-Distribution Generalization

Xingxuan Zhang,Peng Cui,Renzhe Xu,Linjun Zhou,Yue He,Zheyan Shen

Approaches based on deep neural networks have achieved striking performance when testing data and training data share similar distribution, but can significantly fail otherwise. Therefore, eliminating the impact of distribution shifts between training and testing data is crucial for building performance-promising deep models. Conventional methods assume either the known heterogeneity of training data (e.g. domain labels) or the approximately equal capacities of different domains. In this paper, we consider a more challenging case where neither of the above assumptions holds. We propose to address this problem by removing the dependencies between features via learning weights for training samples, which helps deep models get rid of spurious correlations and, in turn, concentrate more on the true connection between discriminative features and labels. Extensive experiments clearly demonstrate the effectiveness of our method on multiple distribution generalization benchmarks compared with state-of-the-art counterparts. Through extensive experiments on distribution generalization benchmarks including PACS, VLCS, MNIST-M, and NICO, we show the effectiveness of our method compared with state-of-the-art counterparts.

entity · 鄰域聚合 · 實體對齊 · 圖 · Networking ·

2019 年 11 月 20 日

Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation

Zequn Sun,Chengming Wang,Wei Hu,Muhao Chen,Jian Dai,Wei Zhang,Yuzhong Qu

from arxiv, Accepted by the 34th AAAI Conference on Artificial Intelligence (AAAI 2020)

Graph neural networks (GNNs) have emerged as a powerful paradigm for embedding-based entity alignment due to their capability of identifying isomorphic subgraphs. However, in real knowledge graphs (KGs), the counterpart entities usually have non-isomorphic neighborhood structures, which easily causes GNNs to yield different representations for them. To tackle this problem, we propose a new KG alignment network, namely AliNet, aiming at mitigating the non-isomorphism of neighborhood structures in an end-to-end manner. As the direct neighbors of counterpart entities are usually dissimilar due to the schema heterogeneity, AliNet introduces distant neighbors to expand the overlap between their neighborhood structures. It employs an attention mechanism to highlight helpful distant neighbors and reduce noises. Then, it controls the aggregation of both direct and distant neighborhood information using a gating mechanism. We further propose a relation loss to refine entity representations. We perform thorough experiments with detailed ablation studies and analyses on five entity alignment datasets, demonstrating the effectiveness of AliNet.

entity · 圖 · 知識圖譜 · MoDELS · 相似度 ·

2019 年 9 月 11 日

Domain Representation for Knowledge Graph Embedding

Cunxiang Wang,Feiliang Ren,Zhichao Lin,Chenxv Zhao,Tian Xie,Yue Zhang

from arxiv, Acceptted by NLPCC2019

Embedding entities and relations into a continuous multi-dimensional vector space have become the dominant method for knowledge graph embedding in representation learning. However, most existing models ignore to represent hierarchical knowledge, such as the similarities and dissimilarities of entities in one domain. We proposed to learn a Domain Representations over existing knowledge graph embedding models, such that entities that have similar attributes are organized into the same domain. Such hierarchical knowledge of domains can give further evidence in link prediction. Experimental results show that domain embeddings give a significant improvement over the most recent state-of-art baseline knowledge graph embedding models.

離散化 · 圖 · 圖形處理器 · Neural Networks · Networking ·

2019 年 3 月 28 日

Learning Discrete Structures for Graph Neural Networks

Luca Franceschi,Mathias Niepert,Massimiliano Pontil,Xiao He

from arxiv, 18 pages

Graph neural networks (GNNs) are a popular class of machine learning models whose major advantage is their ability to incorporate a sparse and discrete dependency structure between data points. Unfortunately, GNNs can only be used when such a graph-structure is available. In practice, however, real-world graphs are often noisy and incomplete or might not be available at all. With this work, we propose to jointly learn the graph structure and the parameters of graph convolutional networks (GCNs) by approximately solving a bilevel program that learns a discrete probability distribution on the edges of the graph. This allows one to apply GCNs not only in scenarios where the given graph is incomplete or corrupted but also in those where a graph is not available. We conduct a series of experiments that analyze the behavior of the proposed method and demonstrate that it outperforms related methods by a significant margin.

平滑 · 注意力機制 · 反向傳播 · 維特比算法 · 正則化項 ·

2018 年 2 月 20 日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arthur Mensch,Mathieu Blondel

Dynamic programming (DP) solves a variety of structured combinatorial problems by iteratively breaking them down into smaller subproblems. In spite of their versatility, DP algorithms are usually non-differentiable, which hampers their use as a layer in neural networks trained by backpropagation. To address this issue, we propose to smooth the max operator in the dynamic programming recursion, using a strongly convex regularizer. This allows to relax both the optimal value and solution of the original combinatorial problem, and turns a broad class of DP algorithms into differentiable operators. Theoretically, we provide a new probabilistic perspective on backpropagating through these DP operators, and relate them to inference in graphical models. We derive two particular instantiations of our framework, a smoothed Viterbi algorithm for sequence prediction and a smoothed DTW algorithm for time-series alignment. We showcase these instantiations on two structured prediction tasks and on structured and sparse attention for neural machine translation.

自動問答 · MoDELS · Networking · Processing（編程語言） · state-of-the-art ·

2018 年 1 月 15 日

An Interpretable Reasoning Network for Multi-Relation Question Answering

Mantong Zhou,Minlie Huang,Xiaoyan Zhu

Multi-relation Question Answering is a challenging task, due to the requirement of elaborated analysis on questions and reasoning over multiple fact triples in knowledge base. In this paper, we present a novel model called Interpretable Reasoning Network that employs an interpretable, hop-by-hop reasoning process for question answering. The model dynamically decides which part of an input question should be analyzed at each hop; predicts a relation that corresponds to the current parsed results; utilizes the predicted relation to update the question representation and the state of the reasoning process; and then drives the next-hop reasoning. Experiments show that our model yields state-of-the-art results on two datasets. More interestingly, the model can offer traceable and observable intermediate predictions for reasoning analysis and failure diagnosis.