亚州AV无码专区在线电影_黄色网站一级二级三级视频_国产精品午夜福利鲁丝片在线_九九RE久久这里有精品_影音先锋在线视频_国内精品久久久久影院大陆_国产亚洲综合在线区尤物

Memory performance is a bottleneck in graph analytics acceleration. Existing Machine Learning (ML) prefetchers struggle with phase transitions and irregular memory accesses in graph processing. We propose MPGraph, an ML-based Prefetcher for Graph analytics using domain specific models. MPGraph introduces three novel optimizations: soft detection for phase transitions, phase-specific multi-modality models for access delta and page predictions, and chain spatio-temporal prefetching (CSTP) for prefetch control. Our transition detector achieves 34.17-82.15% higher precision compared with Kolmogorov-Smirnov Windowing and decision tree. Our predictors achieve 6.80-16.02% higher F1-score for delta and 11.68-15.41% higher accuracy-at-10 for page prediction compared with LSTM and vanilla attention models. Using CSTP, MPGraph achieves 12.52-21.23% IPC improvement, outperforming state-of-the-art non-ML prefetcher BO by 7.58-12.03% and ML-based prefetchers Voyager and TransFetch by 3.27-4.58%. For practical implementation, we demonstrate MPGraph using compressed models with reduced latency shows significantly superior accuracy and coverage compared with BO, leading to 3.58% higher IPC improvement.

相關內容

圖

關注 6

Continuity · Performer · 機器人 · 控制器 · Legged Robot ·

2023 年 11 月 8 日

RAMIEL: A Parallel-Wire Driven Monopedal Robot for High and Continuous Jumping

Temma Suzuki,Yasunori Toshimitsu,Yuya Nagamatsu,Kento Kawaharazuka,Akihiro Miki,Yoshimoto Ribayashi,Masahiro Bando,Kunio Kojima,Yohei Kakiuchi,Kei Okada,Masayuki Inaba

from arxiv, accepted at IROS2022 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, website - //tenrobo18.github.io/ramiel-iros2022/

Legged robots with high locomotive performance have been extensively studied, and various leg structures have been proposed. Especially, a leg structure that can achieve both continuous and high jumps is advantageous for moving around in a three-dimensional environment. In this study, we propose a parallel wire-driven leg structure, which has one DoF of linear motion and two DoFs of rotation and is controlled by six wires, as a structure that can achieve both continuous jumping and high jumping. The proposed structure can simultaneously achieve high controllability on each DoF, long acceleration distance and high power required for jumping. In order to verify the jumping performance of the parallel wire-driven leg structure, we have developed a parallel wire-driven monopedal robot, RAMIEL. RAMIEL is equipped with quasi-direct drive, high power wire winding mechanisms and a lightweight leg, and can achieve a maximum jumping height of 1.6 m and a maximum of seven continuous jumps.

容差 · MoDELS · DNN · state-of-the-art · Institute of Deep Learning ·

2023 年 11 月 7 日

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Insu Jang,Zhenning Yang,Zhen Zhang,Xin Jin,Mosharaf Chowdhury

from arxiv, SOSP'23 | Camera-ready + figures and numbers are corrected

Oobleck enables resilient distributed training of large DNN models with guaranteed fault tolerance. It takes a planning-execution co-design approach, where it first generates a set of heterogeneous pipeline templates and instantiates at least $f+1$ logically equivalent pipeline replicas to tolerate any $f$ simultaneous failures. During execution, it relies on already-replicated model states across the replicas to provide fast recovery. Oobleck provably guarantees that some combination of the initially created pipeline templates can be used to cover all available resources after $f$ or fewer simultaneous failures, thereby avoiding resource idling at all times. Evaluation on large DNN models with billions of parameters shows that Oobleck provides consistently high throughput, and it outperforms state-of-the-art fault tolerance solutions like Bamboo and Varuna by up to $29.6x$.

Learning · 圖 · 結構化學習 · Processing（編程語言） · MoDELS ·

2023 年 11 月 7 日

OpenGSL: A Comprehensive Benchmark for Graph Structure Learning

Zhiyao Zhou,Sheng Zhou,Bochao Mao,Xuanyi Zhou,Jiawei Chen,Qiaoyu Tan,Daochen Zha,Yan Feng,Chun Chen,Can Wang

from arxiv, 9 pages, 4 figures. Accepted by NeurIPS Datasets and Benchmarks Track 2023

Graph Neural Networks (GNNs) have emerged as the de facto standard for representation learning on graphs, owing to their ability to effectively integrate graph topology and node attributes. However, the inherent suboptimal nature of node connections, resulting from the complex and contingent formation process of graphs, presents significant challenges in modeling them effectively. To tackle this issue, Graph Structure Learning (GSL), a family of data-centric learning approaches, has garnered substantial attention in recent years. The core concept behind GSL is to jointly optimize the graph structure and the corresponding GNN models. Despite the proposal of numerous GSL methods, the progress in this field remains unclear due to inconsistent experimental protocols, including variations in datasets, data processing techniques, and splitting strategies. In this paper, we introduce OpenGSL, the first comprehensive benchmark for GSL, aimed at addressing this gap. OpenGSL enables a fair comparison among state-of-the-art GSL methods by evaluating them across various popular datasets using uniform data processing and splitting strategies. Through extensive experiments, we observe that existing GSL methods do not consistently outperform vanilla GNN counterparts. We also find that there is no significant correlation between the homophily of the learned structure and task performance, challenging the common belief. Moreover, we observe that the learned graph structure demonstrates a strong generalization ability across different GNN models, despite the high computational and space consumption. We hope that our open-sourced library will facilitate rapid and equitable evaluation and inspire further innovative research in this field. The code of the benchmark can be found in //github.com/OpenGSL/OpenGSL.

近似 · 線性的 · 塊 · 操作 · Continuity ·

2023 年 11 月 6 日

Splitting Schemes for Coupled Differential Equations: Block Schur-Based Approaches and Partial Jacobi Approximation

Roberto Nuca,Erlend Storvik,Florin A. Radu,Matteo Icardi

Coupled multi-physics problems are encountered in countless applications and pose significant numerical challenges. Although monolithic approaches offer possibly the best solution strategy, they often require ad-hoc preconditioners and numerical implementations. Sequential (also known as splitted, partitioned or segregated) approaches are iterative methods for solving coupled problems where each equation is solved independently and the coupling is achieved through iterations. These methods offer the possibility to flexibly add or remove equations from a model and to rely on existing black-box solvers for every specific equation. Furthermore, when problems are non-linear, inner iterations need to be performed even in monolithic solvers, therefore making a sequential iterative approach a viable alternative. The cost of running inner iterations to achieve the coupling, however, could easily becomes prohibitive, or, in some cases the iterations might not converge. In this work we present a general formulation of splitting schemes for continuous operators, with arbitrary implicit/explicit splitting, like in standard iterative methods for linear systems. By introducing a generic relaxation operator we find the conditions for the convergence of the iterative schemes. We show how the relaxation operator can be thought as a preconditioner and constructed based on an approximate Schur-complement. We propose a Schur-based Partial Jacobi relaxation operator to stabilise the coupling and show its effectiveness. Although we mainly focus on scalar-scalar linear problems, most results are easily extended to non-linear and higher-dimensional problems. Numerical tests (1D and 2D) for two PDE systems, namely the Dual-Porosity model and a Quad-Laplacian operator, are carried out to confirm the theoretical results.

IP · 可辨認的 · 評論員 · 真實值 · 可理解性 ·

2023 年 11 月 6 日

Nautilus: A Framework for Cross-Layer Cartography of Submarine Cables and IP Links

Alagappan Ramanathan,Sangeetha Abdu Jyothi

Submarine cables constitute the backbone of the Internet. However, these critical infrastructure components are vulnerable to several natural and man-made threats, and during failures, are difficult to repair in their remote oceanic environments. In spite of their crucial role, we have a limited understanding of the impact of submarine cable failures on global connectivity, particularly on the higher layers of the Internet. In this paper, we present Nautilus, a framework for cross-layer cartography of submarine cables and IP links. Using a corpus of public datasets and Internet cartographic techniques, Nautilus identifies IP links that are likely traversing submarine cables and maps them to one or more potential cables. Nautilus also gives each IP to cable assignment a prediction score that reflects the confidence in the mapping. Nautilus generates a mapping for 3.05 million and 1.43 million IPv4 and IPv6 links respectively, covering 91% of all active cables. In the absence of ground truth data, we validate Nautilus mapping using three techniques: analyzing past cable failures, using targeted traceroute measurements, and comparing with public network maps of two operators.

情景 · MoDELS · Processing（編程語言） · 離散化 · 泛函 ·

2023 年 11 月 4 日

Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

Paul Hagemann,Sophie Mildenberger,Lars Ruthotto,Gabriele Steidl,Nicole Tianjiao Yang

Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. Existing SBDMs are typically formulated in a finite-dimensional setting, where images are considered as tensors of finite size. This paper develops SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. Besides the quest for generating images at ever higher resolution, our primary motivation is to create a well-posed infinite-dimensional learning problem so that we can discretize it consistently on multiple resolution levels. We thereby intend to obtain diffusion models that generalize across different resolution levels and improve the efficiency of the training process. We demonstrate how to overcome two shortcomings of current SBDM approaches in the infinite-dimensional setting. First, we modify the forward process to ensure that the latent distribution is well-defined in the infinite-dimensional setting using the notion of trace class operators. We derive the reverse processes for finite approximations. Second, we illustrate that approximating the score function with an operator network is beneficial for multilevel training. After deriving the convergence of the discretization and the approximation of multilevel training, we implement an infinite-dimensional SBDM approach and show the first promising results on MNIST and Fashion-MNIST, underlining our developed theory.

模型評估 · 圖 · FAST · 劃分 · 縮放 ·

2023 年 11 月 4 日

Entropy Aware Training for Fast and Accurate Distributed GNN

Dhruv Deshmukh,Gagan Raj Gupta,Manisha Chawla,Vishwesh Jatala,Anirban Haldar

from arxiv, 8 pages, 3 figures, 5 tables, accepted at ICDM'23

Several distributed frameworks have been developed to scale Graph Neural Networks (GNNs) on billion-size graphs. On several benchmarks, we observe that the graph partitions generated by these frameworks have heterogeneous data distributions and class imbalance, affecting convergence, and resulting in lower performance than centralized implementations. We holistically address these challenges and develop techniques that reduce training time and improve accuracy. We develop an Edge-Weighted partitioning technique to improve the micro average F1 score (accuracy) by minimizing the total entropy. Furthermore, we add an asynchronous personalization phase that adapts each compute-host's model to its local data distribution. We design a class-balanced sampler that considerably speeds up convergence. We implemented our algorithms on the DistDGL framework and observed that our training techniques scale much better than the existing training approach. We achieved a (2-3x) speedup in training time and 4\% improvement on average in micro-F1 scores on 5 large graph benchmarks compared to the standard baselines.

Learning · 可約的 · 機器人 · Seven · HTTPS ·

2023 年 11 月 3 日

HYDRA: Hybrid Robot Actions for Imitation Learning

Suneel Belkhale,Yuchen Cui,Dorsa Sadigh

Imitation Learning (IL) is a sample efficient paradigm for robot learning using expert demonstrations. However, policies learned through IL suffer from state distribution shift at test time, due to compounding errors in action prediction which lead to previously unseen states. Choosing an action representation for the policy that minimizes this distribution shift is critical in imitation learning. Prior work propose using temporal action abstractions to reduce compounding errors, but they often sacrifice policy dexterity or require domain-specific knowledge. To address these trade-offs, we introduce HYDRA, a method that leverages a hybrid action space with two levels of action abstractions: sparse high-level waypoints and dense low-level actions. HYDRA dynamically switches between action abstractions at test time to enable both coarse and fine-grained control of a robot. In addition, HYDRA employs action relabeling to increase the consistency of actions in the dataset, further reducing distribution shift. HYDRA outperforms prior imitation learning methods by 30-40% on seven challenging simulation and real world environments, involving long-horizon tasks in the real world like making coffee and toasting bread. Videos are found on our website: //tinyurl.com/3mc6793z

Pyramid · MoDELS · Extensibility · state-of-the-art · Performer ·

2022 年 12 月 1 日

Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis

Wan-Cyuan Fan,Yen-Chun Chen,Dongdong Chen,Yu Cheng,Lu Yuan,Yu-Chiang Frank Wang

from arxiv, AAAI 2023

Diffusion models (DMs) have shown great potential for high-quality image synthesis. However, when it comes to producing images with complex scenes, how to properly describe both image global structures and object details remains a challenging task. In this paper, we present Frido, a Feature Pyramid Diffusion model performing a multi-scale coarse-to-fine denoising process for image synthesis. Our model decomposes an input image into scale-dependent vector quantized features, followed by a coarse-to-fine gating for producing image output. During the above multi-scale representation learning stage, additional input conditions like text, scene graph, or image layout can be further exploited. Thus, Frido can be also applied for conditional or cross-modality image synthesis. We conduct extensive experiments over various unconditioned and conditional image generation tasks, ranging from text-to-image synthesis, layout-to-image, scene-graph-to-image, to label-to-image. More specifically, we achieved state-of-the-art FID scores on five benchmarks, namely layout-to-image on COCO and OpenImages, scene-graph-to-image on COCO and Visual Genome, and label-to-image on COCO. Code is available at //github.com/davidhalladay/Frido.

圖 · 結構化學習 · 穩健性 · 學成 · GNN ·

2021 年 3 月 4 日

Deep Graph Structure Learning for Robust Representations: A Survey

Yanqiao Zhu,Weizhi Xu,Jinghao Zhang,Qiang Liu,Shu Wu,Liang Wang

from arxiv, 8 pages, in submission to IJCAI 2021 (Survey Track)

Graph Neural Networks (GNNs) are widely used for analyzing graph-structured data. Most GNN methods are highly sensitive to the quality of graph structures and usually require a perfect graph structure for learning informative embeddings. However, the pervasiveness of noise in graphs necessitates learning robust representations for real-world problems. To improve the robustness of GNN models, many studies have been proposed around the central concept of Graph Structure Learning (GSL), which aims to jointly learn an optimized graph structure and corresponding representations. Towards this end, in the presented survey, we broadly review recent progress of GSL methods for learning robust representations. Specifically, we first formulate a general paradigm of GSL, and then review state-of-the-art methods classified by how they model graph structures, followed by applications that incorporate the idea of GSL in other graph tasks. Finally, we point out some issues in current studies and discuss future directions.