午夜剧场成年免费视,碰碰女人公开免费视频,初女破初的视频免费看,亚洲欧美中文日韩在线观看,欧美日韩国产一区二区三区在线

We present LEMMING, a modular log-linear model that jointly models lemmatization and tagging and supports the integration of arbitrary global features. It is trainable on corpora annotated with gold standard tags and lemmata and does not rely on morphological dictionaries or analyzers. LEMMING sets the new state of the art in token-based statistical lemmatization on six languages; e.g., for Czech lemmatization, we reduce the error by 60%, from 4.05 to 1.58. We also give empirical evidence that jointly modeling morphological tags and lemmata is mutually beneficial.

相關內容

MoDELS

關注 43

ACM/IEEE第23屆模型驅動工程語言和系統國際會議，是模型驅動軟件和系統工程的首要會議系列，由ACM-SIGSOFT和IEEE-TCSE支持組織。自1998年以來，模型涵蓋了建模的各個方面，從語言和方法到工具和應用程序。模特的參加者來自不同的背景，包括研究人員、學者、工程師和工業專業人士。MODELS 2019是一個論壇，參與者可以圍繞建模和模型驅動的軟件和系統交流前沿研究成果和創新實踐經驗。今年的版本將為建模社區提供進一步推進建模基礎的機會，并在網絡物理系統、嵌入式系統、社會技術系統、云計算、大數據、機器學習、安全、開源等新興領域提出建模的創新應用以及可持續性。官網鏈接： · 掩碼 · Prompt · 語言模型化 · Engineering ·

2024 年 7 月 8 日

Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks

Lukas Netz,Jan Reimar,Bernhard Rumpe

from arxiv, Preprint to be published in the MODELS Workshop "MDE Intelligence"

We present and evaluate a method called grammar masking, which is used to guide large language models (LLMs) toward producing syntactically correct models for a given context-free grammar. Prompt engineering methods such as few-shot learning or priming can be used to improve the chances of an LLM producing correct syntax, but the more complex the grammar, the more time-consuming and less promising these methods become. Previous work is focused primarily on the usage of either language model training or prompt engineering. In this work, a method is presented that restricts the output to a given grammar using constrained decoding to ensure the output adheres to a valid syntax. We use several DSLs built with MontiCore and task multiple LLMs to produce models with and without constrained decoding. A corresponding parser is used to confirm the syntactic correctness of each model. We show that grammar masking can dramatically improve the modeling capabilities of several LLMs, reducing the need for well-refined prompting while increasing the chance of producing correct models.

劃分 · 優化器 · 近似 · CASE · Facebook AI Research ·

2024 年 7 月 8 日

Partitioning Problems with Splittings and Interval Targets

Samuel Bismuth,Vladislav Makarov,Erel Segal-Halevi,Dana Shapira

The $n$-way number partitioning problem is a classic problem in combinatorial optimization, with applications to diverse settings such as fair allocation and machine scheduling. All these problems are NP-hard, but various approximation algorithms are known. We consider three closely related kinds of approximations. The first two variants optimize the partition such that: in the first variant some fixed number $s$ of items can be \emph{split} between two or more bins and in the second variant we allow at most a fixed number $t$ of \emph{splittings}. The third variant is a decision problem: the largest bin sum must be within a pre-specified interval, parameterized by a fixed rational number $u$ times the largest item size. When the number of bins $n$ is unbounded, we show that every variant is strongly {\sf NP}-complete. When the number of bins $n$ is fixed, the running time depends on the fixed parameters $s,t,u$. For each variant, we give a complete picture of its running time. For $n=2$, the running time is easy to identify. Our main results consider any fixed integer $n \geq 3$. Using a two-way polynomial-time reduction between the first and the third variant, we show that $n$-way number-partitioning with $s$ split items can be solved in polynomial time if $s \geq n-2$, and it is {\sf NP}-complete otherwise. Also, $n$-way number-partitioning with $t$ splittings can be solved in polynomial time if $t \geq n-1$, and it is {\sf NP}-complete otherwise. Finally, we show that the third variant can be solved in polynomial time if $u \geq (n-2)/n$, and it is {\sf NP}-complete otherwise. Our positive results for the optimization problems consider both min-max and max-min versions. Using the same reduction, and we provide a fully polynomial-time approximation scheme for the case where the number of split items is lower than $n-2$.

contrastive · 圖 · Learning · GROUP · 圖卷積神經網絡/圖卷積網絡 ·

2024 年 7 月 6 日

Consistency and Discrepancy-Based Contrastive Tripartite Graph Learning for Recommendations

Linxin Guo,Yaochen Zhu,Min Gao,Yinghui Tao,Junliang Yu,Chen Chen

Tripartite graph-based recommender systems markedly diverge from traditional models by recommending unique combinations such as user groups and item bundles. Despite their effectiveness, these systems exacerbate the longstanding cold-start problem in traditional recommender systems, because any number of user groups or item bundles can be formed among users or items. To address this issue, we introduce a Consistency and Discrepancy-based graph contrastive learning method for tripartite graph-based Recommendation. This approach leverages two novel meta-path-based metrics consistency and discrepancy to capture nuanced, implicit associations between the recommended objects and the recommendees. These metrics, indicative of high-order similarities, can be efficiently calculated with infinite graph convolutional networks layers under a multi-objective optimization framework, using the limit theory of GCN.

Performer · 有向 · JFP · MoDELS · 通道 ·

2024 年 7 月 6 日

Flexible Antenna Arrays for Wireless Communications: Modeling and Performance Evaluation

Songjie Yang,Jiancheng An,Yue Xiu,Wanting Lyu,Boyu Ning,Zhongpei Zhang,Merouane Debbah,Chau Yuen

Flexible antenna arrays (FAAs), distinguished by their rotatable, bendable, and foldable properties, are extensively employed in flexible radio systems to achieve customized radiation patterns. This paper aims to illustrate that FAAs, capable of dynamically adjusting surface shapes, can enhance communication performances with both omni-directional and directional antenna patterns, in terms of multi-path channel power and channel angle Cram\'{e}r-Rao bounds. To this end, we develop a mathematical model that elucidates the impacts of the variations in antenna positions and orientations as the array transitions from a flat to a rotated, bent, and folded state, all contingent on the flexible degree-of-freedom. Moreover, since the array shape adjustment operates across the entire beamspace, especially with directional patterns, we discuss the sum-rate in the multi-sector base station that covers the $360^\circ$ communication area. Particularly, to thoroughly explore the multi-sector sum-rate, we propose separate flexible precoding (SFP), joint flexible precoding (JFP), and semi-joint flexible precoding (SJFP), respectively. In our numerical analysis comparing the optimized FAA to the fixed uniform planar array, we find that the bendable FAA achieves a remarkable $156\%$ sum-rate improvement compared to the fixed planar array in the case of JFP with the directional pattern. Furthermore, the rotatable FAA exhibits notably superior performance in SFP and SJFP cases with omni-directional patterns, with respective $35\%$ and $281\%$.

INTERACT · 潛在 · MoDELS · 控制器 · 講稿 ·

2024 年 7 月 5 日

A Mapping Strategy for Interacting with Latent Audio Synthesis Using Artistic Materials

Shuoyang Zheng,Anna Xambó Sedó,Nick Bryan-Kinns

This paper presents a mapping strategy for interacting with the latent spaces of generative AI models. Our approach involves using unsupervised feature learning to encode a human control space and mapping it to an audio synthesis model's latent space. To demonstrate how this mapping strategy can turn high-dimensional sensor data into control mechanisms of a deep generative model, we present a proof-of-concept system that uses visual sketches to control an audio synthesis model. We draw on emerging discourses in XAIxArts to discuss how this approach can contribute to XAI in artistic and creative contexts, we also discuss its current limitations and propose future research directions.

神經形態計算 · Next · 計算機體系架構 ·

2024 年 7 月 5 日

Roadmap to Neuromorphic Computing with Emerging Technologies

Adnan Mehonic,Daniele Ielmini,Kaushik Roy,Onur Mutlu,Shahar Kvatinsky,Teresa Serrano-Gotarredona,Bernabe Linares-Barranco,Sabina Spiga,Sergey Savelev,Alexander G Balanov,Nitin Chawla,Giuseppe Desoli,Gerardo Malavena,Christian Monzio Compagnoni,Zhongrui Wang,J Joshua Yang,Ghazi Sarwat Syed,Abu Sebastian,Thomas Mikolajick,Beatriz Noheda,Stefan Slesazeck,Bernard Dieny, Tuo-Hung, Hou,Akhil Varri,Frank Bruckerhoff-Pluckelmann,Wolfram Pernice,Xixiang Zhang,Sebastian Pazos,Mario Lanza,Stefan Wiefels,Regina Dittmann,Wing H Ng,Mark Buckwell,Horatio RJ Cox,Daniel J Mannion,Anthony J Kenyon,Yingming Lu,Yuchao Yang,Damien Querlioz,Louis Hutin,Elisa Vianello,Sayeed Shafayet Chowdhury,Piergiulio Mannocci,Yimao Cai,Zhong Sun,Giacomo Pedretti,John Paul Strachan,Dmitri Strukov,Manuel Le Gallo,Stefano Ambrogio,Ilia Valov,Rainer Waser

from arxiv, 90 pages, 22 figures, roadmap, neuromorphic

The roadmap is organized into several thematic sections, outlining current computing challenges, discussing the neuromorphic computing approach, analyzing mature and currently utilized technologies, providing an overview of emerging technologies, addressing material challenges, exploring novel computing concepts, and finally examining the maturity level of emerging technologies while determining the next essential steps for their advancement.

MoDELS · 語言模型化 · Learning · 逼真度 · 講稿 ·

2024 年 4 月 11 日

Best Practices and Lessons Learned on Synthetic Data for Language Models

Ruibo Liu,Jerry Wei,Fangyu Liu,Chenglei Si,Yanzhe Zhang,Jinmeng Rao,Steven Zheng,Daiyi Peng,Diyi Yang,Denny Zhou,Andrew M. Dai

The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challenges, and future directions. We present empirical evidence from prior art to demonstrate its effectiveness and highlight the importance of ensuring its factuality, fidelity, and unbiasedness. We emphasize the need for responsible use of synthetic data to build more powerful, inclusive, and trustworthy language models.

示例 · 端到端 · 變換 · MoDELS · 可理解性 ·

2021 年 3 月 24 日

End-to-End Video Instance Segmentation with Transformers

Yuqing Wang,Zhaoliang Xu,Xinlong Wang,Chunhua Shen,Baoshan Cheng,Hao Shen,Huaxia Xia

from arxiv, CVPR2021 Oral

Video instance segmentation (VIS) is the task that requires simultaneously classifying, segmenting and tracking object instances of interest in video. Recent methods typically develop sophisticated pipelines to tackle this task. Here, we propose a new video instance segmentation framework built upon Transformers, termed VisTR, which views the VIS task as a direct end-to-end parallel sequence decoding/prediction problem. Given a video clip consisting of multiple image frames as input, VisTR outputs the sequence of masks for each instance in the video in order directly. At the core is a new, effective instance sequence matching and segmentation strategy, which supervises and segments instances at the sequence level as a whole. VisTR frames the instance segmentation and tracking in the same perspective of similarity learning, thus considerably simplifying the overall pipeline and is significantly different from existing approaches. Without bells and whistles, VisTR achieves the highest speed among all existing VIS models, and achieves the best result among methods using single model on the YouTube-VIS dataset. For the first time, we demonstrate a much simpler and faster video instance segmentation framework built upon Transformers, achieving competitive accuracy. We hope that VisTR can motivate future research for more video understanding tasks.

事件抽取 · 學成 · 逆強化學習 · GAN · 估計/估計量 ·

2018 年 4 月 21 日

Event Extraction with Generative Adversarial Imitation Learning

Tongtao Zhang,Heng Ji

We propose a new method for event extraction (EE) task based on an imitation learning framework, specifically, inverse reinforcement learning (IRL) via generative adversarial network (GAN). The GAN estimates proper rewards according to the difference between the actions committed by the expert (or ground truth) and the agent among complicated states in the environment. EE task benefits from these dynamic rewards because instances and labels yield to various extents of difficulty and the gains are expected to be diverse -- e.g., an ambiguous but correctly detected trigger or argument should receive high gains -- while the traditional RL models usually neglect such differences and pay equal attention on all instances. Moreover, our experiments also demonstrate that the proposed framework outperforms state-of-the-art methods, without explicit feature engineering.

平滑 · 注意力機制 · 反向傳播 · 維特比算法 · 正則化項 ·

2018 年 2 月 20 日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arthur Mensch,Mathieu Blondel

Dynamic programming (DP) solves a variety of structured combinatorial problems by iteratively breaking them down into smaller subproblems. In spite of their versatility, DP algorithms are usually non-differentiable, which hampers their use as a layer in neural networks trained by backpropagation. To address this issue, we propose to smooth the max operator in the dynamic programming recursion, using a strongly convex regularizer. This allows to relax both the optimal value and solution of the original combinatorial problem, and turns a broad class of DP algorithms into differentiable operators. Theoretically, we provide a new probabilistic perspective on backpropagating through these DP operators, and relate them to inference in graphical models. We derive two particular instantiations of our framework, a smoothed Viterbi algorithm for sequence prediction and a smoothed DTW algorithm for time-series alignment. We showcase these instantiations on two structured prediction tasks and on structured and sparse attention for neural machine translation.