丰满人妻被公侵犯高清版_国产精品午夜福利鲁丝片在线_日韩有码中文字幕欧美精品一区二区三区_美女一级视频久久_最新嫖妓视频在线观看_欧美精品中文字幕第一页_影音先锋在线在线资源网站

Time series forecasting holds significant importance in many real-world dynamic systems and has been extensively studied. Unlike natural language process (NLP) and computer vision (CV), where a single large model can tackle multiple tasks, models for time series forecasting are often specialized, necessitating distinct designs for different tasks and applications. While pre-trained foundation models have made impressive strides in NLP and CV, their development in time series domains has been constrained by data sparsity. Recent studies have revealed that large language models (LLMs) possess robust pattern recognition and reasoning abilities over complex sequences of tokens. However, the challenge remains in effectively aligning the modalities of time series data and natural language to leverage these capabilities. In this work, we present Time-LLM, a reprogramming framework to repurpose LLMs for general time series forecasting with the backbone language models kept intact. We begin by reprogramming the input time series with text prototypes before feeding it into the frozen LLM to align the two modalities. To augment the LLM's ability to reason with time series data, we propose Prompt-as-Prefix (PaP), which enriches the input context and directs the transformation of reprogrammed input patches. The transformed time series patches from the LLM are finally projected to obtain the forecasts. Our comprehensive evaluations demonstrate that Time-LLM is a powerful time series learner that outperforms state-of-the-art, specialized forecasting models. Moreover, Time-LLM excels in both few-shot and zero-shot learning scenarios.

相關內容

大語言模型(xing)

關注 56

大(da)語(yu)言(yan)(yan)(yan)模(mo)型(xing)是(shi)基(ji)于海量(liang)文(wen)本(ben)數(shu)據訓(xun)練的(de)(de)(de)深(shen)度學習模(mo)型(xing)。它(ta)不(bu)僅(jin)能(neng)(neng)夠生成自(zi)然(ran)語(yu)言(yan)(yan)(yan)文(wen)本(ben)，還(huan)能(neng)(neng)夠深(shen)入理解(jie)文(wen)本(ben)含(han)義，處理各種(zhong)自(zi)然(ran)語(yu)言(yan)(yan)(yan)任務(wu)，如文(wen)本(ben)摘(zhai)要、問答(da)、翻譯等(deng)。2023年，大(da)語(yu)言(yan)(yan)(yan)模(mo)型(xing)及(ji)(ji)其(qi)在人(ren)工智能(neng)(neng)領域的(de)(de)(de)應(ying)用已成為全球科技(ji)研究的(de)(de)(de)熱點，其(qi)在規模(mo)上的(de)(de)(de)增長尤為引人(ren)注目，參(can)數(shu)量(liang)已從最初的(de)(de)(de)十幾億(yi)躍升到如今的(de)(de)(de)一(yi)萬億(yi)。參(can)數(shu)量(liang)的(de)(de)(de)提(ti)升使得(de)模(mo)型(xing)能(neng)(neng)夠更(geng)加精細(xi)地捕捉(zhuo)人(ren)類語(yu)言(yan)(yan)(yan)微(wei)妙(miao)之處，更(geng)加深(shen)入地理解(jie)人(ren)類語(yu)言(yan)(yan)(yan)的(de)(de)(de)復雜性(xing)。在過去的(de)(de)(de)一(yi)年里(li)，大(da)語(yu)言(yan)(yan)(yan)模(mo)型(xing)在吸納新(xin)知識、分解(jie)復雜任務(wu)以及(ji)(ji)圖文(wen)對齊等(deng)多方面都有顯著(zhu)提(ti)升。隨著(zhu)技(ji)術的(de)(de)(de)不(bu)斷成熟，它(ta)將不(bu)斷拓展其(qi)應(ying)用范圍，為人(ren)類提(ti)供(gong)更(geng)加智能(neng)(neng)化和個(ge)性(xing)化的(de)(de)(de)服務(wu)，進(jin)一(yi)步(bu)改(gai)善人(ren)們(men)的(de)(de)(de)生活和生產方式。

Attention · DCN · Learning · Softmax · 估計/估計量 ·

2024 年 3 月 11 日

LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression

Wei Jiang,Junru Li,Kai Zhang,Li Zhang

from arxiv, Accepted to ICASSP 2024 (lecture presentation). The first attempt to use cross attention for bits-free motion estimation and motion compensation

Existing learned video compression models employ flow net or deformable convolutional networks (DCN) to estimate motion information. However, the limited receptive fields of flow net and DCN inherently direct their attentiveness towards the local contexts. Global contexts, such as large-scale motions and global correlations among frames are ignored, presenting a significant bottleneck for capturing accurate motions. To address this issue, we propose a joint local and global motion compensation module (LGMC) for leaned video coding. More specifically, we adopt flow net for local motion compensation. To capture global context, we employ the cross attention in feature domain for motion compensation. In addition, to avoid the quadratic complexity of vanilla cross attention, we divide the softmax operations in attention into two independent softmax operations, leading to linear complexity. To validate the effectiveness of our proposed LGMC, we integrate it with DCVC-TCM and obtain learned video compression with joint local and global motion compensation (LVC-LGMC). Extensive experiments demonstrate that our LVC-LGMC has significant rate-distortion performance improvements over baseline DCVC-TCM.

通道 · TransAct · Extensibility · 規范化的 · state-of-the-art ·

2024 年 3 月 10 日

ABC-Channel: An Advanced Blockchain-based Covert Channel

Xiaobo Ma,Pengyu Pan,Jianfeng Li,Wei Wang,Weizhi Meng,Xiaohong Guan

Establishing efficient and robust covert channels is crucial for secure communication within insecure network environments. With its inherent benefits of decentralization and anonymization, blockchain has gained considerable attention in developing covert channels. To guarantee a highly secure covert channel, channel negotiation should be contactless before the communication, carrier transaction features must be indistinguishable from normal transactions during the communication, and communication identities must be untraceable after the communication. Such a full-lifecycle covert channel is indispensable to defend against a versatile adversary who intercepts two communicating parties comprehensively (e.g., on-chain and off-chain). Unfortunately, it has not been thoroughly investigated in the literature. We make the first effort to achieve a full-lifecycle covert channel, a novel blockchain-based covert channel named ABC-Channel. We tackle a series of challenges, such as off-chain contact dependency, increased masquerading difficulties as growing transaction volume, and time-evolving, communicable yet untraceable identities, to achieve contactless channel negotiation, indistinguishable transaction features, and untraceable communication identities, respectively. We develop a working prototype to validate ABC-Channel and conduct extensive tests on the Bitcoin testnet. The experimental results demonstrate that ABC-Channel achieves substantially secure covert capabilities. In comparison to existing methods, it also exhibits state-of-the-art transmission efficiency.

MIMO · 自動問答 · state-of-the-art · Wireless Networks · 極大似然 ·

2024 年 3 月 9 日

X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible Parallelism

Minsung Kim,Abhishek Kumar Singh,Davide Venturelli,John Kaewell,Kyle Jamieson

from arxiv, 22 pages

Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless performance. The approach aims to leverage QA to expedite the computation required for theoretically optimal but computationally-demanding Maximum Likelihood detection to overcome the limitations of the currently deployed linear detectors. This paper presents X-ResQ, a QA-based MIMO detector system featuring fine-grained quantum task parallelism that is uniquely enabled by the Reverse Annealing (RA) protocol. Unlike prior designs, X-ResQ has many desirable system properties for a parallel QA detector and has effectively improved detection performance as more qubits are assigned. In our evaluations on a state-of-the-art quantum annealer, fully parallel X-ResQ achieves near-optimal throughput (over 10 bits/s/Hz) for $4\times6$ MIMO with 16-QAM using six levels of parallelism with 240 qubits and $220~\mu$s QA compute time, achieving 2.5--5$\times$ gains compared against other tested detectors. For more comprehensive evaluations, we implement and evaluate X-ResQ in the non-quantum digital setting. This non-quantum X-ResQ demonstration showcases the potential to realize ultra-large $1024\times1024$ MIMO, significantly outperforming other MIMO detectors, including the state-of-the-art RA detector classically implemented in the same way.

圖 · 估計/估計量 · Extensibility · Facebook AI Research · 近似 ·

2024 年 3 月 8 日

Precedence-Constrained Winter Value for Effective Graph Data Valuation

Hongliang Chi,Wei Jin,Charu Aggarwal,Yao Ma

from arxiv, 17 pages in total

Data valuation is essential for quantifying data's worth, aiding in assessing data quality and determining fair compensation. While existing data valuation methods have proven effective in evaluating the value of Euclidean data, they face limitations when applied to the increasingly popular graph-structured data. Particularly, graph data valuation introduces unique challenges, primarily stemming from the intricate dependencies among nodes and the exponential growth in value estimation costs. To address the challenging problem of graph data valuation, we put forth an innovative solution, Precedence-Constrained Winter (PC-Winter) Value, to account for the complex graph structure. Furthermore, we develop a variety of strategies to address the computational challenges and enable efficient approximation of PC-Winter. Extensive experiments demonstrate the effectiveness of PC-Winter across diverse datasets and tasks.

回合 · 優化器 · 估計/估計量 · Markov · Processing（編程語言） ·

2024 年 3 月 7 日

Weathering Ongoing Uncertainty: Learning and Planning in a Time-Varying Partially Observable Environment

Gokul Puthumanaillam,Xiangyu Liu,Negar Mehr,Melkior Ornik

from arxiv, Page 3, fixed typo

Optimal decision-making presents a significant challenge for autonomous systems operating in uncertain, stochastic and time-varying environments. Environmental variability over time can significantly impact the system's optimal decision making strategy for mission completion. To model such environments, our work combines the previous notion of Time-Varying Markov Decision Processes (TVMDP) with partial observability and introduces Time-Varying Partially Observable Markov Decision Processes (TV-POMDP). We propose a two-pronged approach to accurately estimate and plan within the TV-POMDP: 1) Memory Prioritized State Estimation (MPSE), which leverages weighted memory to provide more accurate time-varying transition estimates; and 2) an MPSE-integrated planning strategy that optimizes long-term rewards while accounting for temporal constraint. We validate the proposed framework and algorithms using simulations and hardware, with robots exploring a partially observable, time-varying environments. Our results demonstrate superior performance over standard methods, highlighting the framework's effectiveness in stochastic, uncertain, time-varying domains.

INTERACT · 機器人 · 正則的 · Performance · MoDELS ·

2024 年 3 月 7 日

A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures

Kensuke Nakamura,Ran Tian,Andrea Bajcsy

from arxiv, 5 figures, 3 tables

Robot decision-making increasingly relies on expressive data-driven human prediction models when operating around people. While these models are known to suffer from prediction errors in out-of-distribution interactions, not all prediction errors equally impact downstream robot performance. We identify that the mathematical notion of regret precisely characterizes the degree to which incorrect predictions of future interaction outcomes degraded closed-loop robot performance. However, canonical regret measures are poorly calibrated across diverse deployment interactions. We extend the canonical notion of regret by deriving a calibrated regret metric that generalizes from absolute reward space to probability space. With this transformation, our metric removes the need for explicit reward functions to calculate the robot's regret, enables fairer comparison of interaction anomalies across disparate deployment contexts, and facilitates targetted dataset construction of "system- level" prediction failures. We experimentally quantify the value of this high-regret interaction data for aiding the robot in improving its downstream decision-making. In a suite of closed- loop autonomous driving simulations, we find that fine-tuning ego-conditioned behavior predictors exclusively on high-regret human-robot interaction data can improve the robot's overall re-deployment performance with significantly (77%) less data.

流 · Networking · MoDELS · 輸出 · 詞元分析器 ·

2024 年 3 月 7 日

ALTO: An Efficient Network Orchestrator for Compound AI Systems

Keshav Santhanam,Deepti Raghavan,Muhammad Shahir Rahman,Thejas Venkatesh,Neha Kunjal,Pratiksha Thaker,Philip Levis,Matei Zaharia

We present ALTO, a network orchestrator for efficiently serving compound AI systems such as pipelines of language models. ALTO achieves high throughput and low latency by taking advantage of an optimization opportunity specific to generative language models: streaming intermediate outputs. As language models produce outputs token by token, ALTO exposes opportunities to stream intermediate outputs between stages when possible. We highlight two new challenges of correctness and load balancing which emerge when streaming intermediate data across distributed pipeline stage instances. We also motivate the need for an aggregation-aware routing interface and distributed prompt-aware scheduling to address these challenges. We demonstrate the impact of ALTO's partial output streaming on a complex chatbot verification pipeline, increasing throughput by up to 3x for a fixed latency target of 4 seconds / request while also reducing tail latency by 1.8x compared to a baseline serving approach.

通道 · 優化器 · Networking · Neural Networks · ML ·

2024 年 3 月 6 日

RISnet: A Domain-Knowledge Driven Neural Network Architecture for RIS Optimization with Mutual Coupling and Partial CSI

Bile Peng,Karl-Ludwig Besser,Shanpu Shen,Finn Siegismund-Poschmann,Ramprasad Raghunath,Daniel Mittleman,Vahid Jamali,Eduard A. Jorswieck

from arxiv, 13 pages, 16 figures

Multiple access techniques are cornerstones of wireless communications. Their performance depends on the channel properties, which can be improved by reconfigurable intelligent surfaces (RISs). In this work, we jointly optimize MA precoding at the base station (BS) and RIS configuration. We tackle difficulties of mutual coupling between RIS elements, scalability to more than 1000 RIS elements, and channel estimation. We first derive an RIS-assisted channel model considering mutual coupling, then propose an unsupervised machine learning (ML) approach to optimize the RIS. In particular, we design a dedicated neural network (NN) architecture RISnet with good scalability and desired symmetry. Moreover, we combine ML-enabled RIS configuration and analytical precoding at BS since there exist analytical precoding schemes. Furthermore, we propose another variant of RISnet, which requires the channel state information (CSI) of a small portion of RIS elements (in this work, 16 out of 1296 elements) if the channel comprises a few specular propagation paths. More generally, this work is an early contribution to combine ML technique and domain knowledge in communication for NN architecture design. Compared to generic ML, the problem-specific ML can achieve higher performance, lower complexity and symmetry.

多峰值 · 學成 · 知識 (knowledge) · Performer · 講稿 ·

2022 年 5 月 3 日

Deep Learning in Multimodal Remote Sensing Data Fusion: A Comprehensive Review

Jiaxin Li,Danfeng Hong,Lianru Gao,Jing Yao,Ke Zheng,Bing Zhang,Jocelyn Chanussot

With the extremely rapid advances in remote sensing (RS) technology, a great quantity of Earth observation (EO) data featuring considerable and complicated heterogeneity is readily available nowadays, which renders researchers an opportunity to tackle current geoscience applications in a fresh way. With the joint utilization of EO data, much research on multimodal RS data fusion has made tremendous progress in recent years, yet these developed traditional algorithms inevitably meet the performance bottleneck due to the lack of the ability to comprehensively analyse and interpret these strongly heterogeneous data. Hence, this non-negligible limitation further arouses an intense demand for an alternative tool with powerful processing competence. Deep learning (DL), as a cutting-edge technology, has witnessed remarkable breakthroughs in numerous computer vision tasks owing to its impressive ability in data representation and reconstruction. Naturally, it has been successfully applied to the field of multimodal RS data fusion, yielding great improvement compared with traditional methods. This survey aims to present a systematic overview in DL-based multimodal RS data fusion. More specifically, some essential knowledge about this topic is first given. Subsequently, a literature survey is conducted to analyse the trends of this field. Some prevalent sub-fields in the multimodal RS data fusion are then reviewed in terms of the to-be-fused data modalities, i.e., spatiospectral, spatiotemporal, light detection and ranging-optical, synthetic aperture radar-optical, and RS-Geospatial Big Data fusion. Furthermore, We collect and summarize some valuable resources for the sake of the development in multimodal RS data fusion. Finally, the remaining challenges and potential future directions are highlighted.

變換 · Extensibility · INFORMS · Performer · MoDELS ·

2020 年 12 月 17 日

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Haoyi Zhou,Shanghang Zhang,Jieqi Peng,Shuai Zhang,Jianxin Li,Hui Xiong,Wancai Zhang

from arxiv, 7 pages (main), 5 pages (appendix) and to be appeared in AAAI2021

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, such as quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a $ProbSparse$ Self-attention mechanism, which achieves $O(L \log L)$ in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.