A级日本乱理伦片免费入口_欧美丰满美乳XXⅩ高潮WWW_亚洲精品无码久久毛片99_一道本天堂在线视频_亚洲综合变态另类无码专区_婷婷开心激情深爱五月天_亚洲日本一区二区

Sergi Blanco-Cuaresma,Ioana Ciuc?,Alberto Accomazzi,Michael J. Kurtz,Edwin A. Henneken,Kelly E. Lockhart,Felix Grezes,Thomas Allen,Golnaz Shapurian,Carolyn S. Grant,Donna M. Thompson,Timothy W. Hostetler,Matthew R. Templeton,Shinyi Chen,Jennifer Koch,Taylor Jacovich,Daniel Chivvis,Fernanda de Macedo Alves,Jean-Claude Paquin,Jennifer Bartlett,Mugdha Polimera,Stephanie Jarmak

from arxiv, To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed an experiment where we created semantic vectors for our large collection of abstracts and full-text content, and we designed a prompt system to ask questions using contextual chunks from our system. Based on a non-systematic human evaluation, the experiment shows a lower degree of hallucination and better responses when using Retrieval Augmented Generation. Further exploration is required to design new features and data augmentation processes at NASA SciX that leverages this technology while respecting the high level of trust and quality that the project holds.

相關內容

大語言模(mo)型

關注 56

大(da)語(yu)言(yan)(yan)模(mo)型是基于海量(liang)文(wen)本數據訓(xun)練(lian)的(de)(de)(de)深(shen)度(du)學習模(mo)型。它不(bu)僅能(neng)夠生成(cheng)自然語(yu)言(yan)(yan)文(wen)本，還能(neng)夠深(shen)入(ru)理(li)解(jie)文(wen)本含義(yi)，處理(li)各種自然語(yu)言(yan)(yan)任務(wu)(wu)，如文(wen)本摘要、問答(da)、翻(fan)譯等。2023年，大(da)語(yu)言(yan)(yan)模(mo)型及其在(zai)人(ren)(ren)工(gong)智能(neng)領域(yu)的(de)(de)(de)應(ying)用(yong)已成(cheng)為(wei)全(quan)球科技研(yan)究的(de)(de)(de)熱點，其在(zai)規模(mo)上的(de)(de)(de)增長尤為(wei)引人(ren)(ren)注目，參數量(liang)已從(cong)最初的(de)(de)(de)十幾(ji)億躍升到如今的(de)(de)(de)一萬(wan)億。參數量(liang)的(de)(de)(de)提(ti)升使(shi)得模(mo)型能(neng)夠更加(jia)精細地捕捉(zhuo)人(ren)(ren)類語(yu)言(yan)(yan)微妙之處，更加(jia)深(shen)入(ru)地理(li)解(jie)人(ren)(ren)類語(yu)言(yan)(yan)的(de)(de)(de)復雜(za)性。在(zai)過去的(de)(de)(de)一年里，大(da)語(yu)言(yan)(yan)模(mo)型在(zai)吸納新知識(shi)、分解(jie)復雜(za)任務(wu)(wu)以及圖文(wen)對齊等多方面都有顯(xian)著(zhu)(zhu)提(ti)升。隨著(zhu)(zhu)技術的(de)(de)(de)不(bu)斷成(cheng)熟，它將不(bu)斷拓展其應(ying)用(yong)范圍(wei)，為(wei)人(ren)(ren)類提(ti)供更加(jia)智能(neng)化(hua)和(he)(he)個性化(hua)的(de)(de)(de)服(fu)務(wu)(wu)，進一步改(gai)善人(ren)(ren)們的(de)(de)(de)生活和(he)(he)生產方式。

Automator · Processing（編程語言） · Analysis · GPT-4 · Integration ·

2024 年 2 月 12 日

Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support

Igor Svoboda,Dmytro Lande

from arxiv, 24 pages, 1 figure

Our study presents a new framework that incorporates the Analytic Hierarchy Process (AHP) and Generative Pre-trained Transformer 4 (GPT-4) large language model (LLM), bringing novel approaches to cybersecurity Multiple-criteria Decision Making (MCDA). By utilizing the capabilities of GPT-4 autonomous agents as virtual experts, we automate the decision-making process, enhancing both efficiency and reliability. This new approach focuses on leveraging LLMs for sophisticated decision analysis, highlighting the synergy between traditional decision-making models and cutting-edge AI technologies. Our innovative methodology demonstrates significant advancements in using AI-driven agents for complex decision-making scenarios, highlighting the importance of AI in strategic cybersecurity applications. The findings reveal the transformative potential of combining AHP and LLMs, establishing a new paradigm for intelligent decision support systems in cybersecurity and beyond.

Vision · MoDELS · 成對型 · 數據集 · motivation ·

2024 年 2 月 11 日

A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs

Zicheng Zhang,Haoning Wu,Erli Zhang,Guangtao Zhai,Weisi Lin

from arxiv, arXiv admin note: substantial text overlap with arXiv:2309.14181

The rapid development of Multi-modality Large Language Models (MLLMs) has navigated a paradigm shift in computer vision, moving towards versatile foundational models. However, evaluating MLLMs in low-level visual perception and understanding remains a yet-to-explore domain. To this end, we design benchmark settings to emulate human language responses related to low-level vision: the low-level visual perception (A1) via visual question answering related to low-level attributes (e.g. clarity, lighting); and the low-level visual description (A2), on evaluating MLLMs for low-level text descriptions. Furthermore, given that pairwise comparison can better avoid ambiguity of responses and has been adopted by many human experiments, we further extend the low-level perception-related question-answering and description evaluations of MLLMs from single images to image pairs. Specifically, for perception (A1), we carry out the LLVisionQA+ dataset, comprising 2,990 single images and 1,999 image pairs each accompanied by an open-ended question about its low-level features; for description (A2), we propose the LLDescribe+ dataset, evaluating MLLMs for low-level descriptions on 499 single images and 450 pairs. Additionally, we evaluate MLLMs on assessment (A3) ability, i.e. predicting score, by employing a softmax-based approach to enable all MLLMs to generate quantifiable quality ratings, tested against human opinions in 7 image quality assessment (IQA) datasets. With 24 MLLMs under evaluation, we demonstrate that several MLLMs have decent low-level visual competencies on single images, but only GPT-4V exhibits higher accuracy on pairwise comparisons than single image evaluations (like humans). We hope that our benchmark will motivate further research into uncovering and enhancing these nascent capabilities of MLLMs. Datasets will be available at //github.com/Q-Future/Q-Bench.

Things · Learning · Machine Learning · 評論員 · AI ·

2024 年 2 月 9 日

Twenty Constructionist Things to Do with Artificial Intelligence and Machine Learning

Yasmin Kafai,Luis Morales-Navarro

In this paper, we build on the 1971 memo "Twenty Things to Do With a Computer" by Seymour Papert and Cynthia Solomon and propose twenty constructionist things to do with artificial intelligence and machine learning. Several proposals build on ideas developed in the original memo while others are new and address topics in science, mathematics, and the arts. In reviewing the big themes, we notice a renewed interest in children's engagement not just for technical proficiency but also to cultivate a deeper understanding of their own cognitive processes. Furthermore, the ideas stress the importance of designing personally relevant AI/ML applications, moving beyond isolated models and off-the-shelf datasets disconnected from their interests. We also acknowledge the social aspects of data production involved in making AI/ML applications. Finally, we highlight the critical dimensions necessary to address potential harmful algorithmic biases and consequences of AI/ML applications.

BERT · MoDELS · 自動問答 · 命名實體識別 · 多樣性 ·

2024 年 2 月 9 日

FaBERT: Pre-training BERT on Persian Blogs

Mostafa Masumi,Seyed Soroush Majd,Mehrnoush Shamsfard,Hamid Beigy

We introduce FaBERT, a Persian BERT-base model pre-trained on the HmBlogs corpus, encompassing both informal and formal Persian texts. FaBERT is designed to excel in traditional Natural Language Understanding (NLU) tasks, addressing the intricacies of diverse sentence structures and linguistic styles prevalent in the Persian language. In our comprehensive evaluation of FaBERT on 12 datasets in various downstream tasks, encompassing Sentiment Analysis (SA), Named Entity Recognition (NER), Natural Language Inference (NLI), Question Answering (QA), and Question Paraphrasing (QP), it consistently demonstrated improved performance, all achieved within a compact model size. The findings highlight the importance of utilizing diverse and cleaned corpora, such as HmBlogs, to enhance the performance of language models like BERT in Persian Natural Language Processing (NLP) applications. FaBERT is openly accessible at //huggingface.co/sbunlp/fabert

SAT · 變換 · 可約的 · 規范化的 · 設計 ·

2024 年 2 月 9 日

On CNF Conversion for SAT and SMT Enumeration

Gabriele Masina,Giuseppe Spallitta,Roberto Sebastiani

from arxiv, 22 pages, 5 figures (38 pages, 12 figures with appendix)

Modern SAT and SMT solvers are designed to handle problems expressed in Conjunctive Normal Form (CNF) so that non-CNF problems must be CNF-ized upfront, typically by using variants of either Tseitin or Plaisted and Greenbaum transformations. When passing from solving to enumeration, however, the capability of producing partial satisfying assignments that are as small as possible becomes crucial, which raises the question of whether such CNF encodings are also effective for enumeration. In this paper, we investigate both theoretically and empirically the effectiveness of CNF conversions for SAT and SMT enumeration. On the negative side, we show that: (i) Tseitin transformation prevents the solver from producing short partial assignments, thus seriously affecting the effectiveness of enumeration; (ii) Plaisted and Greenbaum transformation overcomes this problem only in part. On the positive side, we prove theoretically and we show empirically that combining Plaisted and Greenbaum transformation with NNF preprocessing upfront -- which is typically not used in solving -- can fully overcome the problem and can drastically reduce both the number of partial assignments and the execution time.

Networking · Neural Networks · DNN · 線性的 · MoDELS ·

2024 年 2 月 8 日

NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference

Ruiqi Sun,Siwei Ye,Jie Zhao,Xin He,Yiran Li,An Zou

from arxiv, 11 pages, 6figures, Submitted to 41st International Conference on Machine Learning

The inherent diversity of computation types within individual Deep Neural Network (DNN) models imposes a corresponding need for a varied set of computation units within hardware processors. This diversity poses a significant constraint on computation efficiency during the execution of different neural networks. In this study, we present NeuralMatrix, a framework that transforms the computation of entire DNNs into linear matrix operations. This transformation seamlessly enables the execution of various DNN models using a single General-Purpose Matrix Multiplication (GEMM) accelerator. Extensive experimental results spanning different DNN models demonstrate that our approach preserves network accuracy while providing both generality and application-specific levels of computation efficiency. This allows a broad spectrum of DNN models to be executed using a single GEMM accelerator, eliminating the need for additional special function units.

剪枝 · Better · CAP · contrastive · MoDELS ·

2021 年 12 月 14 日

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Runxin Xu,Fuli Luo,Chengyu Wang,Baobao Chang,Jun Huang,Songfang Huang,Fei Huang

from arxiv, Accepted to AAAI 2022

Pre-trained Language Models (PLMs) have achieved great success in various Natural Language Processing (NLP) tasks under the pre-training and fine-tuning paradigm. With large quantities of parameters, PLMs are computation-intensive and resource-hungry. Hence, model pruning has been introduced to compress large-scale PLMs. However, most prior approaches only consider task-specific knowledge towards downstream tasks, but ignore the essential task-agnostic knowledge during pruning, which may cause catastrophic forgetting problem and lead to poor generalization ability. To maintain both task-agnostic and task-specific knowledge in our pruned model, we propose ContrAstive Pruning (CAP) under the paradigm of pre-training and fine-tuning. It is designed as a general framework, compatible with both structured and unstructured pruning. Unified in contrastive learning, CAP enables the pruned model to learn from the pre-trained model for task-agnostic knowledge, and fine-tuned model for task-specific knowledge. Besides, to better retain the performance of the pruned model, the snapshots (i.e., the intermediate models at each pruning iteration) also serve as effective supervisions for pruning. Our extensive experiments show that adopting CAP consistently yields significant improvements, especially in extremely high sparsity scenarios. With only 3% model parameters reserved (i.e., 97% sparsity), CAP successfully achieves 99.2% and 96.3% of the original BERT performance in QQP and MNLI tasks. In addition, our probing experiments demonstrate that the model pruned by CAP tends to achieve better generalization ability.

Pegasus · Performer · state-of-the-art · MoDELS · ROUGE ·

2020 年 6 月 2 日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Jingqing Zhang,Yao Zhao,Mohammad Saleh,Peter J. Liu

from arxiv, Added Human Evaluation results; Code link added; Accepted for ICML 2020

Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for abstractive text summarization have not been explored. Furthermore there is a lack of systematic evaluation across diverse domains. In this work, we propose pre-training large Transformer-based encoder-decoder models on massive text corpora with a new self-supervised objective. In PEGASUS, important sentences are removed/masked from an input document and are generated together as one output sequence from the remaining sentences, similar to an extractive summary. We evaluated our best PEGASUS model on 12 downstream summarization tasks spanning news, science, stories, instructions, emails, patents, and legislative bills. Experiments demonstrate it achieves state-of-the-art performance on all 12 downstream datasets measured by ROUGE scores. Our model also shows surprising performance on low-resource summarization, surpassing previous state-of-the-art results on 6 datasets with only 1000 examples. Finally we validated our results using human evaluation and show that our model summaries achieve human performance on multiple datasets.

conceptNet · 圖 · 語言模型化 · Atom（文本編輯器） · MoDELS ·

2019 年 12 月 19 日

Commonsense Knowledge Base Completion with Structural and Semantic Context

Chaitanya Malaviya,Chandra Bhagavatula,Antoine Bosselut,Yejin Choi

from arxiv, AAAI 2020

Automatic KB completion for commonsense knowledge graphs (e.g., ATOMIC and ConceptNet) poses unique challenges compared to the much studied conventional knowledge bases (e.g., Freebase). Commonsense knowledge graphs use free-form text to represent nodes, resulting in orders of magnitude more nodes compared to conventional KBs (18x more nodes in ATOMIC compared to Freebase (FB15K-237)). Importantly, this implies significantly sparser graph structures - a major challenge for existing KB completion methods that assume densely connected graphs over a relatively smaller set of nodes. In this paper, we present novel KB completion models that can address these challenges by exploiting the structural and semantic context of nodes. Specifically, we investigate two key ideas: (1) learning from local graph structure, using graph convolutional networks and automatic graph densification and (2) transfer learning from pre-trained language models to knowledge graphs for enhanced contextual representation of knowledge. We describe our method to incorporate information from both these sources in a joint model and provide the first empirical results for KB completion on ATOMIC and evaluation with ranking metrics on ConceptNet. Our results demonstrate the effectiveness of language model representations in boosting link prediction performance and the advantages of learning from local graph structure (+1.5 points in MRR for ConceptNet) when training on subgraphs for computational efficiency. Further analysis on model predictions shines light on the types of commonsense knowledge that language models capture well.

語言模型化 · MoDELS · 詞表 · 優化器 · state-of-the-art ·

2019 年 9 月 25 日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Sanqiang Zhao,Raghav Gupta,Yang Song,Denny Zhou

Pre-trained deep neural network language models such as ELMo, GPT, BERT and XLNet have recently achieved state-of-the-art performance on a variety of language understanding tasks. However, their size makes them impractical for a number of scenarios, especially on mobile and edge devices. In particular, the input word embedding matrix accounts for a significant proportion of the model's memory footprint, due to the large input vocabulary and embedding dimensions. Knowledge distillation techniques have had success at compressing large neural network models, but they are ineffective at yielding student models with vocabularies different from the original teacher models. We introduce a novel knowledge distillation technique for training a student model with a significantly smaller vocabulary as well as lower embedding and hidden state dimensions. Specifically, we employ a dual-training mechanism that trains the teacher and student models simultaneously to obtain optimal word embeddings for the student vocabulary. We combine this approach with learning shared projection matrices that transfer layer-wise knowledge from the teacher model to the student model. Our method is able to compress the BERT_BASE model by more than 60x, with only a minor drop in downstream task metrics, resulting in a language model with a footprint of under 7MB. Experimental results also demonstrate higher compression efficiency and accuracy when compared with other state-of-the-art compression techniques.