四虎亚洲精品高清在线观看_爆乳护士一区二区三区在线播放_亚洲成A人在线电影_国产一区二区免费精品无码电影_日产高清一区二区三区2022_天天拍天天爽一区二区三区_理论片一区二区三区在线

As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences. Toward making further progress in this direction, we strive to build a deeper understanding of the gaps between massive LLMs (e.g., ChatGPT) and smaller yet effective open-source LLMs and their distilled counterparts. To this end, we specifically focus on long-form question answering (LFQA) because it has several practical and impactful applications (e.g., troubleshooting, customer service, etc.) yet is still understudied and challenging for LLMs. We propose a question-generation method from abstractive summaries and show that generating follow-up questions from summaries of long documents can create a challenging setting for LLMs to reason and infer from long contexts. Our experimental results confirm that: (1) our proposed method of generating questions from abstractive summaries pose a challenging setup for LLMs and shows performance gaps between LLMs like ChatGPT and open-source LLMs (Alpaca, Llama) (2) open-source LLMs exhibit decreased reliance on context for generated questions from the original document, but their generation capabilities drop significantly on generated questions from summaries -- especially for longer contexts (>1024 tokens)

知識薈萃

精品入門和進階教程(cheng)、論文和代碼整(zheng)理等

查看相關VIP內容、論文、資訊等

Performer · Machine Translation · 有向 · 優化器 · MoDELS ·

2023 年 10 月 31 日

On the Pareto Front of Multilingual Neural Machine Translation

Liang Chen,Shuming Ma,Dongdong Zhang,Furu Wei,Baobao Chang

from arxiv, NeurIPS 2023

In this work, we study how the performance of a given direction changes with its sampling ratio in Multilingual Neural Machine Translation (MNMT). By training over 200 multilingual models with various model sizes, data sizes, and language directions, we find it interesting that the performance of certain translation direction does not always improve with the increase of its weight in the multi-task optimization objective. Accordingly, scalarization method leads to a multitask trade-off front that deviates from the traditional Pareto front when there exists data imbalance in the training corpus, which poses a great challenge to improve the overall performance of all directions. Based on our observations, we propose the Double Power Law to predict the unique performance trade-off front in MNMT, which is robust across various languages, data adequacy, and the number of tasks. Finally, we formulate the sample ratio selection problem in MNMT as an optimization problem based on the Double Power Law. In our experiments, it achieves better performance than temperature searching and gradient manipulation methods with only 1/5 to 1/2 of the total training budget. We release the code at //github.com/pkunlp-icler/ParetoMNMT for reproduction.

INFORMS · 多峰值 · Performer · 模態 · 標注 ·

2023 年 10 月 30 日

Multimodal Fusion Interactions: A Study of Human and Automatic Quantification

Paul Pu Liang,Yun Cheng,Ruslan Salakhutdinov,Louis-Philippe Morency

from arxiv, International Conference on Multimodal Interaction (ICMI '23), Code available at: //github.com/pliang279/PID. arXiv admin note: text overlap with arXiv:2302.12247

In order to perform multimodal fusion of heterogeneous signals, we need to understand their interactions: how each modality individually provides information useful for a task and how this information changes in the presence of other modalities. In this paper, we perform a comparative study of how humans annotate two categorizations of multimodal interactions: (1) partial labels, where different annotators annotate the label given the first, second, and both modalities, and (2) counterfactual labels, where the same annotator annotates the label given the first modality before asking them to explicitly reason about how their answer changes when given the second. We further propose an alternative taxonomy based on (3) information decomposition, where annotators annotate the degrees of redundancy: the extent to which modalities individually and together give the same predictions, uniqueness: the extent to which one modality enables a prediction that the other does not, and synergy: the extent to which both modalities enable one to make a prediction that one would not otherwise make using individual modalities. Through experiments and annotations, we highlight several opportunities and limitations of each approach and propose a method to automatically convert annotations of partial and counterfactual labels to information decomposition, yielding an accurate and efficient method for quantifying multimodal interactions.

Performer · Better · Learning · MoDELS · 深度學習 ·

2023 年 10 月 29 日

The Power of Explainability in Forecast-Informed Deep Learning Models for Flood Mitigation

Jimeng Shi,Vitalii Stebliankin,Giri Narasimhan

Floods can cause horrific harm to life and property. However, they can be mitigated or even avoided by the effective use of hydraulic structures such as dams, gates, and pumps. By pre-releasing water via these structures in advance of extreme weather events, water levels are sufficiently lowered to prevent floods. In this work, we propose FIDLAR, a Forecast Informed Deep Learning Architecture, achieving flood management in watersheds with hydraulic structures in an optimal manner by balancing out flood mitigation and unnecessary wastage of water via pre-releases. We perform experiments with FIDLAR using data from the South Florida Water Management District, which manages a coastal area that is highly prone to frequent storms and floods. Results show that FIDLAR performs better than the current state-of-the-art with several orders of magnitude speedup and with provably better pre-release schedules. The dramatic speedups make it possible for FIDLAR to be used for real-time flood management. The main contribution of this paper is the effective use of tools for model explainability, allowing us to understand the contribution of the various environmental factors towards its decisions.

SimPLe · Analysis · 相似度 · LDPC · 圖 ·

2023 年 10 月 29 日

Simple Constructions of Unique Neighbor Expanders from Error-correcting Codes

Swastik Kopparty,Noga Ron-Zewi,Shubhangi Saraf

In this note, we give very simple constructions of unique neighbor expander graphs starting from spectral or combinatorial expander graphs of mild expansion. These constructions and their analysis are simple variants of the constructions of LDPC error-correcting codes from expanders, given by Sipser-Spielman\cite{SS96} (and Tanner\cite{Tanner81}), and their analysis. We also show how to obtain expanders with many unique neighbors using similar ideas. There were many exciting results on this topic recently, starting with Asherov-Dinur\cite{AD23} and Hsieh-McKenzie-Mohanty-Paredes\cite{HMMP23}, who gave a similar construction of unique neighbor expander graphs, but using more sophisticated ingredients (such as almost-Ramanujan graphs) and a more involved analysis. Subsequent beautiful works of Cohen-Roth-TaShma\cite{CRT23} and Golowich\cite{Golowich23} gave even stronger objects (lossless expanders), but also using sophisticated ingredients. The main contribution of this work is that we get much more elementary constructions of unique neighbor expanders and with a simpler analysis.

泛函 · 分離的 · 類別 ·

2023 年 10 月 29 日

Power of Counting by Nonuniform Families of Polynomial-Size Finite Automata

Tomoyuki Yamakami

from arxiv, (A4, 10pt, 21 pages) This paper corrects and extends a preliminary report published in the Proceedings of the 24th International Symposium on Fundamentals of Computation Theory (FCT 2023), Trier, Germany, September 18-24, 2023, Lecture Notes in Computer Science, vol. 14292, pp. 421-435, Springer Cham, 2023

Lately, there have been intensive studies on strengths and limitations of nonuniform families of promise decision problems solvable by various types of polynomial-size finite automata families, where "polynomial-size" refers to the polynomially-bounded state complexity of a finite automata family. In this line of study, we further expand the scope of these studies to families of partial counting and gap functions, defined in terms of nonuniform families of polynomial-size nondeterministic finite automata, and their relevant families of promise decision problems. Counting functions have an ability of counting the number of accepting computation paths produced by nondeterministic finite automata. With no unproven hardness assumption, we show numerous separations and collapses of complexity classes of those partial counting and gap function families and their induced promise decision problem families. We also investigate their relationships to pushdown automata families of polynomial stack-state complexity.

Performer · Pyramid · 可理解性 · 多峰值 · 自編碼器 ·

2023 年 10 月 28 日

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs

Lijun Yu,Yong Cheng,Zhiruo Wang,Vivek Kumar,Wolfgang Macherey,Yanping Huang,David A. Ross,Irfan Essa,Yonatan Bisk,Ming-Hsuan Yang,Kevin Murphy,Alexander G. Hauptmann,Lu Jiang

from arxiv, NeurIPS 2023 spotlight

In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks involving non-linguistic modalities such as images or videos. SPAE converts between raw pixels and interpretable lexical tokens (or words) extracted from the LLM's vocabulary. The resulting tokens capture both the semantic meaning and the fine-grained details needed for visual reconstruction, effectively translating the visual content into a language comprehensible to the LLM, and empowering it to perform a wide array of multimodal tasks. Our approach is validated through in-context learning experiments with frozen PaLM 2 and GPT 3.5 on a diverse set of image understanding and generation tasks. Our method marks the first successful attempt to enable a frozen LLM to generate image content while surpassing state-of-the-art performance in image understanding tasks, under the same setting, by over 25%.

估計/估計量 · 采樣法 · Networking · 樣本 · Performer ·

2023 年 10 月 28 日

Network Sampling Methods for Estimating Social Networks, Population Percentages, and Totals of People Experiencing Unsheltered Homelessness

Zack W. Almquist,Ashley Hazel,Owen Kajfasz,Janelle Rothfolk,Claire Guilmette,Mary-Catherine Anderson,Larisa Ozeryansky,Amy Hagopian

In this article, we propose using network-based sampling strategies to estimate the number of unsheltered people experiencing homelessness within a given administrative service unit, known as a Continuum of Care. We demonstrate the effectiveness of network sampling methods to solve this problem. Here, we focus on Respondent Driven Sampling (RDS), which has been shown to provide unbiased or low-biased estimates of totals and proportions for hard-to-reach populations in contexts where a sampling frame (e.g., housing addresses) is not available. To make the RDS estimator work for estimating the total number of people living unsheltered, we introduce a new method that leverages administrative data from the HUD-mandated Homeless Management Information System (HMIS). The HMIS provides high-quality counts and demographics for people experiencing homelessness who sleep in emergency shelters. We then demonstrate this method using network data collected in Nashville, TN, combined with simulation methods to illustrate the efficacy of this approach and introduce a method for performing a power analysis to find the optimal sample size in this setting. We conclude with the RDS unsheltered PIT count conducted by King County Regional Homelessness Authority in 2022 (data publicly available on the HUD website) and perform a comparative analysis between the 2022 RDS estimate of unsheltered people experiencing homelessness and an ARIMA forecast of the visual unsheltered PIT count. Finally, we discuss how this method works for estimating the unsheltered population of people experiencing homelessness and future areas of research.

Networking · 可理解性 · Analysis · 穩健性 · 優化器 ·

2023 年 10 月 27 日

A Survey of Network Requirements for Enabling Effective Cyber Deception

Md Abu Sayed,Moqsadur Rahman,Mohammad Ariful Islam Khan,Deepak Tosh

In the evolving landscape of cybersecurity, the utilization of cyber deception has gained prominence as a proactive defense strategy against sophisticated attacks. This paper presents a comprehensive survey that investigates the crucial network requirements essential for the successful implementation of effective cyber deception techniques. With a focus on diverse network architectures and topologies, we delve into the intricate relationship between network characteristics and the deployment of deception mechanisms. This survey provides an in-depth analysis of prevailing cyber deception frameworks, highlighting their strengths and limitations in meeting the requirements for optimal efficacy. By synthesizing insights from both theoretical and practical perspectives, we contribute to a comprehensive understanding of the network prerequisites crucial for enabling robust and adaptable cyber deception strategies.

語言模型化 · MoDELS · Performer · 自動問答 · 論文 ·

2023 年 10 月 26 日

On the Risk of Misinformation Pollution with Large Language Models

Yikang Pan,Liangming Pan,Wenhu Chen,Preslav Nakov,Min-Yen Kan,William Yang Wang

from arxiv, EMNLP 2023 (Findings; Long Paper)

In this paper, we comprehensively investigate the potential misuse of modern Large Language Models (LLMs) for generating credible-sounding misinformation and its subsequent impact on information-intensive applications, particularly Open-Domain Question Answering (ODQA) systems. We establish a threat model and simulate potential misuse scenarios, both unintentional and intentional, to assess the extent to which LLMs can be utilized to produce misinformation. Our study reveals that LLMs can act as effective misinformation generators, leading to a significant degradation in the performance of ODQA systems. To mitigate the harm caused by LLM-generated misinformation, we explore three defense strategies: prompting, misinformation detection, and majority voting. While initial results show promising trends for these defensive strategies, much more work needs to be done to address the challenge of misinformation pollution. Our work highlights the need for further research and interdisciplinary collaboration to address LLM-generated misinformation and to promote responsible use of LLMs.

圖 · Networking · 學成 · Performer · 深度學習 ·

2020 年 10 月 9 日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Emanuele Rossi,Ben Chamberlain,Fabrizio Frasca,Davide Eynard,Federico Monti,Michael Bronstein

Graph Neural Networks (GNNs) have recently become increasingly popular due to their ability to learn complex systems of relations or interactions arising in a broad spectrum of problems ranging from biology and particle physics to social networks and recommendation systems. Despite the plethora of different models for deep learning on graphs, few approaches have been proposed thus far for dealing with graphs that present some sort of dynamic nature (e.g. evolving features or connectivity over time). In this paper, we present Temporal Graph Networks (TGNs), a generic, efficient framework for deep learning on dynamic graphs represented as sequences of timed events. Thanks to a novel combination of memory modules and graph-based operators, TGNs are able to significantly outperform previous approaches being at the same time more computationally efficient. We furthermore show that several previous models for learning on dynamic graphs can be cast as specific instances of our framework. We perform a detailed ablation study of different components of our framework and devise the best configuration that achieves state-of-the-art performance on several transductive and inductive prediction tasks for dynamic graphs.