国产一区二区高清无码_日韩一区国产二区不卡_亚洲日韩精品欧美1区2区3区在线观看_日韩精品中文字幕一区二区中文_国产一区二区三区精品免费视频_国产在线一区二区三区视频_美女极度色诱视频国产舒心

Over the last fifty years, the United States have experienced hundreds of mass public shootings that resulted in thousands of victims. Characterized by their frequent occurrence and devastating nature, mass shootings have become a major public health hazard that dramatically impact safety and well-being of individuals and communities. Given the epidemic traits of this phenomenon, there have been concerted efforts to understand the root causes that lead to public mass shootings in order to implement effective prevention strategies. We propose a quantile mixed graphical model for investigating the intricacies of inter- and infra-domain relationships of this complex phenomenon, where conditional relations between discrete and continuous variables are modeled without stringent distributional assumptions using Parzen's definition of mid-quantile. To retrieve the graph structure and recover only the most relevant connections, we consider the neighborhood selection approach in which conditional mid-quantiles of each variable in the network are modeled as a sparse function of all others. We propose a two-step procedure to estimate the graph where, in the first step, conditional mid-probabilities are obtained semi-parametrically and, in the second step, the model parameters are estimated by solving an implicit equation with a LASSO penalty.

相關內容

MASS

關注 0

MASS：IEEE International Conference on Mobile Ad-hoc and Sensor Systems。 Explanation：移動Ad hoc和傳感器系統IEEE國際會議。 Publisher：IEEE。 SIT：

MoDELS · GPT-4V · HTTPS · 語言模型化 · Analysis ·

2023 年 11 月 13 日

An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

Junyang Wang,Yuhang Wang,Guohai Xu,Jing Zhang,Yukai Gu,Haitao Jia,Ming Yan,Ji Zhang,Jitao Sang

from arxiv, 11 pages, 4 figures

Despite making significant progress in multi-modal tasks, current Multi-modal Large Language Models (MLLMs) encounter the significant challenge of hallucination, which may lead to harmful consequences. Therefore, evaluating MLLMs' hallucinations is becoming increasingly important in model improvement and practical application deployment. Previous works are limited in high evaluation costs (e.g., relying on humans or advanced LLMs) and insufficient evaluation dimensions (e.g., types of hallucination and task). In this paper, we propose an LLM-free multi-dimensional benchmark AMBER, which can be used to evaluate both generative task and discriminative task including object existence, object attribute and object relation hallucination. Based on AMBER, we design a low-cost and efficient evaluation pipeline. Additionally, we conduct a comprehensive evaluation and detailed analysis of mainstream MLLMs including GPT-4V(ision), and also give guideline suggestions for mitigating hallucinations. The data and code of AMBER are available at //github.com/junyangwang0410/AMBER.

情景 · 基準 · MoDELS · 規范化的 · 數據集 ·

2023 年 11 月 12 日

Setting a Baseline for long-shot real-time Player and Ball detection in Soccer Videos

Konstantinos Moutselos,Ilias Maglogiannis

from arxiv, 6 pages, 4 figures, 1 table. 14th International Conference on Information,Intelligence, Systems and Applications (IISA 2023) , Thessaly, Volos, Greece, 10-12 July 2023

Players and ball detection are among the first required steps on a football analytics platform. Until recently, the existing open datasets on which the evaluations of most models were based, were not sufficient. In this work, we point out their weaknesses, and with the advent of the SoccerNet v3, we propose and deliver to the community an edited part of its dataset, in YOLO normalized annotation format for training and evaluation. The code of the methods and metrics are provided so that they can be used as a benchmark in future comparisons. The recent YOLO8n model proves better than FootAndBall in long-shot real-time detection of the ball and players on football fields.

變換 · Graph Transformer · Analysis · 圖 · MoDELS ·

2023 年 11 月 10 日

A higher-order transformation approach to the formalization and analysis of BPMN using graph transformation systems

Tim Kr?uter,Adrian Rutle,Harald K?nig,Yngve Lamo

The Business Process Modeling Notation (BPMN) is a widely used standard notation for defining intra- and inter-organizational workflows. However, the informal description of the BPMN execution semantics leads to different interpretations of BPMN elements and difficulties in checking behavioral properties. In this article, we propose a formalization of the execution semantics of BPMN that, compared to existing approaches, covers more BPMN elements while also facilitating property checking. Our approach is based on a higher-order transformation from BPMN models to graph transformation systems. To show the capabilities of our approach, we implemented it as an open-source web-based tool.

Integration · 符號學 ·

2023 年 11 月 9 日

Reduction-based Creative Telescoping for P-recursive Sequences via Integral Bases

Shaoshi Chen,Lixin Du,Manuel Kauers,Rong-Hua Wang

from arxiv, 20 pages

We propose a way to split a given bivariate P-recursive sequence into a summable part and a non-summable part in such a way that the non-summable part is minimal in some sense. This decomposition gives rise to a new reduction-based creative telescoping algorithm based on the concept of integral bases.

估計/估計量 · 時間步 · 優化器 · MoDELS · CASES ·

2023 年 11 月 9 日

Global error estimates of high-order fully decoupled schemes for the Cahn-Hilliard-Navier-Stokes model of Two-Phase Incompressible Flows

Xiaoli Li,Nan Zheng,Jie Shen,Zhengguang Liu

from arxiv, arXiv admin note: text overlap with arXiv:2009.09353

In this paper we construct new fully decoupled and high-order implicit-explicit (IMEX) schemes for the two-phase incompressible flows based on the new generalized scalar auxiliary variable approach with optimal energy approximation (EOP-GSAV) for Cahn-Hilliard equation and consistent splitting method for Navier-Stokes equation. These schemes are linear, fully decoupled, unconditionally energy stable, only require solving a sequence of elliptic equations with constant coefficients at each time step, and provide a new technique to preserve the consistency between original energy and modified energy. We derive that numerical solutions of these schemes are uniformly bounded without any restriction on time step size. Furthermore, we carry out a rigorous error analysis for the first-order scheme and establish optimal global error estimates for the phase function, velocity and pressure in two and three-dimensional cases. Numerical examples are presented to validate the proposed schemes.

剪枝 · Better · CAP · contrastive · MoDELS ·

2021 年 12 月 14 日

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Runxin Xu,Fuli Luo,Chengyu Wang,Baobao Chang,Jun Huang,Songfang Huang,Fei Huang

from arxiv, Accepted to AAAI 2022

Pre-trained Language Models (PLMs) have achieved great success in various Natural Language Processing (NLP) tasks under the pre-training and fine-tuning paradigm. With large quantities of parameters, PLMs are computation-intensive and resource-hungry. Hence, model pruning has been introduced to compress large-scale PLMs. However, most prior approaches only consider task-specific knowledge towards downstream tasks, but ignore the essential task-agnostic knowledge during pruning, which may cause catastrophic forgetting problem and lead to poor generalization ability. To maintain both task-agnostic and task-specific knowledge in our pruned model, we propose ContrAstive Pruning (CAP) under the paradigm of pre-training and fine-tuning. It is designed as a general framework, compatible with both structured and unstructured pruning. Unified in contrastive learning, CAP enables the pruned model to learn from the pre-trained model for task-agnostic knowledge, and fine-tuned model for task-specific knowledge. Besides, to better retain the performance of the pruned model, the snapshots (i.e., the intermediate models at each pruning iteration) also serve as effective supervisions for pruning. Our extensive experiments show that adopting CAP consistently yields significant improvements, especially in extremely high sparsity scenarios. With only 3% model parameters reserved (i.e., 97% sparsity), CAP successfully achieves 99.2% and 96.3% of the original BERT performance in QQP and MNLI tasks. In addition, our probing experiments demonstrate that the model pruned by CAP tends to achieve better generalization ability.

語言模型化 · MoDELS · IR · 似然 · 掩碼語言模型化 ·

2020 年 10 月 20 日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Xinyu Ma,Jiafeng Guo,Ruqing Zhang,Yixing Fan,Xiang Ji,Xueqi Cheng

from arxiv, Accepted by WSDM2021

Recently pre-trained language representation models such as BERT have shown great success when fine-tuned on downstream tasks including information retrieval (IR). However, pre-training objectives tailored for ad-hoc retrieval have not been well explored. In this paper, we propose Pre-training with Representative wOrds Prediction (PROP) for ad-hoc retrieval. PROP is inspired by the classical statistical language model for IR, specifically the query likelihood model, which assumes that the query is generated as the piece of text representative of the "ideal" document. Based on this idea, we construct the representative words prediction (ROP) task for pre-training. Given an input document, we sample a pair of word sets according to the document language model, where the set with higher likelihood is deemed as more representative of the document. We then pre-train the Transformer model to predict the pairwise preference between the two word sets, jointly with the Masked Language Model (MLM) objective. By further fine-tuning on a variety of representative downstream ad-hoc retrieval tasks, PROP achieves significant improvements over baselines without pre-training or with other pre-training methods. We also show that PROP can achieve exciting performance under both the zero- and low-resource IR settings. The code and pre-trained models are available at //github.com/Albert-Ma/PROP.

Pegasus · Performer · state-of-the-art · MoDELS · ROUGE ·

2020 年 6 月 2 日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Jingqing Zhang,Yao Zhao,Mohammad Saleh,Peter J. Liu

from arxiv, Added Human Evaluation results; Code link added; Accepted for ICML 2020

Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for abstractive text summarization have not been explored. Furthermore there is a lack of systematic evaluation across diverse domains. In this work, we propose pre-training large Transformer-based encoder-decoder models on massive text corpora with a new self-supervised objective. In PEGASUS, important sentences are removed/masked from an input document and are generated together as one output sequence from the remaining sentences, similar to an extractive summary. We evaluated our best PEGASUS model on 12 downstream summarization tasks spanning news, science, stories, instructions, emails, patents, and legislative bills. Experiments demonstrate it achieves state-of-the-art performance on all 12 downstream datasets measured by ROUGE scores. Our model also shows surprising performance on low-resource summarization, surpassing previous state-of-the-art results on 6 datasets with only 1000 examples. Finally we validated our results using human evaluation and show that our model summaries achieve human performance on multiple datasets.

MoDELS · 模型評估 · NLP · Extensibility · 可辨認的 ·

2020 年 5 月 8 日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Marco Tulio Ribeiro,Tongshuang Wu,Carlos Guestrin,Sameer Singh

Although measuring held-out accuracy has been the primary approach to evaluate generalization, it often overestimates the performance of NLP models, while alternative approaches for evaluating models either focus on individual tasks or on specific behaviors. Inspired by principles of behavioral testing in software engineering, we introduce CheckList, a task-agnostic methodology for testing NLP models. CheckList includes a matrix of general linguistic capabilities and test types that facilitate comprehensive test ideation, as well as a software tool to generate a large and diverse number of test cases quickly. We illustrate the utility of CheckList with tests for three tasks, identifying critical failures in both commercial and state-of-art models. In a user study, a team responsible for a commercial sentiment analysis model found new and actionable bugs in an extensively tested model. In another user study, NLP practitioners with CheckList created twice as many tests, and found almost three times as many bugs as users without it.

LayoutLM · INFORMS · 可理解性 · SCAN · MoDELS ·

2020 年 2 月 19 日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Yiheng Xu,Minghao Li,Lei Cui,Shaohan Huang,Furu Wei,Ming Zhou

from arxiv, Work in progress

Pre-training techniques have been verified successfully in a variety of NLP tasks in recent years. Despite the widespread of pre-training models for NLP applications, they almost focused on text-level manipulation, while neglecting the layout and style information that is vital for document image understanding. In this paper, we propose the LayoutLM to jointly model the interaction between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents. Furthermore, we also leverage the image features to incorporate the visual information of words into LayoutLM. To the best of our knowledge, this is the first time that text and layout are jointly learned in a single framework for document-level pre-training. It achieves new state-of-the-art results in several downstream tasks, including form understanding (from 70.72 to 79.27), receipt understanding (from 94.02 to 95.24) and document image classification (from 93.07 to 94.42). The code and pre-trained LayoutLM models are publicly available at //github.com/microsoft/unilm/tree/master/layoutlm.