国产日黄色大片一区二区_91人妻社区论坛精选_国产免费观看视频_成人高清视频在线_欧美另类在线观看完整版_精品日韩欧美一区夜夜嗨_国产成人TV在线观看

This document provides the annotation guidelines for MaiBaam, a Bavarian corpus manually annotated with part-of-speech (POS) tags, syntactic dependencies, and German lemmas. MaiBaam belongs to the Universal Dependencies (UD) project, and our annotations elaborate on the general and German UD version 2 guidelines. In this document, we detail how to preprocess and tokenize Bavarian data, provide an overview of the POS tags and dependencies we use, explain annotation decisions that would also apply to closely related languages like German, and lastly we introduce and motivate decisions that are specific to Bavarian grammar.

相關內容

詞性標注

關注 389

詞(ci)性（part-of-speech）是詞(ci)匯基本的(de)語法(fa)(fa)屬性，通常也稱(cheng)為(wei)詞(ci)類(lei)。詞(ci)性標(biao)(biao)注(zhu)(zhu)就(jiu)是在(zai)給定(ding)句(ju)子中(zhong)判(pan)定(ding)每個詞(ci)的(de)語法(fa)(fa)范(fan)疇(chou)，確定(ding)其(qi)詞(ci)性并(bing)加以標(biao)(biao)注(zhu)(zhu)的(de)過程，是中(zhong)文(wen)信息處理面(mian)臨的(de)重要基礎性問(wen)題。在(zai)語料(liao)庫(ku)語言學中(zhong)，詞(ci)性標(biao)(biao)注(zhu)(zhu)（POS標(biao)(biao)注(zhu)(zhu)或PoS標(biao)(biao)注(zhu)(zhu)或POST），也稱(cheng)為(wei)語法(fa)(fa)標(biao)(biao)注(zhu)(zhu)，是將文(wen)本（語料(liao)庫(ku)）中(zhong)的(de)單詞(ci)標(biao)(biao)注(zhu)(zhu)為(wei)與特定(ding)詞(ci)性相(xiang)對應的(de)過程，[1] 基于(yu)其(qi)定(ding)義和(he)上下文(wen)。

多樣性 · 得分 · 蒸餾 · 3D · 優化器 ·

2024 年 12 月 9 日

Diverse Score Distillation

Yanbo Xu,Jayanth Srinivasa,Gaowen Liu,Shubham Tulsiani

from arxiv, Project Page: //billyxyb.github.io/Diverse-Score-Distillation/

Score distillation of 2D diffusion models has proven to be a powerful mechanism to guide 3D optimization, for example enabling text-based 3D generation or single-view reconstruction. A common limitation of existing score distillation formulations, however, is that the outputs of the (mode-seeking) optimization are limited in diversity despite the underlying diffusion model being capable of generating diverse samples. In this work, inspired by the sampling process in denoising diffusion, we propose a score formulation that guides the optimization to follow generation paths defined by random initial seeds, thus ensuring diversity. We then present an approximation to adopt this formulation for scenarios where the optimization may not precisely follow the generation paths (e.g. a 3D representation whose renderings evolve in a co-dependent manner). We showcase the applications of our `Diverse Score Distillation' (DSD) formulation across tasks such as 2D optimization, text-based 3D inference, and single-view reconstruction. We also empirically validate DSD against prior score distillation formulations and show that it significantly improves sample diversity while preserving fidelity.

Neural Networks · MoDELS · Networking · 模型評估 · 有限差分 ·

2024 年 12 月 6 日

Differentiable Weightless Neural Networks

Alan T. L. Bacellar,Zachary Susskind,Mauricio Breternitz Jr.,Eugene John,Lizy K. John,Priscila M. V. Lima,Felipe M. G. Fran?a

We introduce the Differentiable Weightless Neural Network (DWN), a model based on interconnected lookup tables. Training of DWNs is enabled by a novel Extended Finite Difference technique for approximate differentiation of binary values. We propose Learnable Mapping, Learnable Reduction, and Spectral Regularization to further improve the accuracy and efficiency of these models. We evaluate DWNs in three edge computing contexts: (1) an FPGA-based hardware accelerator, where they demonstrate superior latency, throughput, energy efficiency, and model area compared to state-of-the-art solutions, (2) a low-power microcontroller, where they achieve preferable accuracy to XGBoost while subject to stringent memory constraints, and (3) ultra-low-cost chips, where they consistently outperform small models in both accuracy and projected hardware area. DWNs also compare favorably against leading approaches for tabular datasets, with higher average rank. Overall, our work positions DWNs as a pioneering solution for edge-compatible high-throughput neural networks.

Automator · Engineering · Performer · 原點 · 服務器 ·

2024 年 12 月 5 日

Federated Automated Feature Engineering

Tom Overman,Diego Klabjan

from arxiv, Preliminary Work

Automated feature engineering (AutoFE) is used to automatically create new features from original features to improve predictive performance without needing significant human intervention and expertise. Many algorithms exist for AutoFE, but very few approaches exist for the federated learning (FL) setting where data is gathered across many clients and is not shared between clients or a central server. We introduce AutoFE algorithms for the horizontal, vertical, and hybrid FL settings, which differ in how the data is gathered across clients. To the best of our knowledge, we are the first to develop AutoFE algorithms for the horizontal and hybrid FL cases, and we show that the downstream model performance of federated AutoFE is similar to the case where data is held centrally and AutoFE is performed centrally.

CASE · 閾值 · 可約的 · 評論員 · 講稿 ·

2024 年 12 月 4 日

Refining Concentration for Gaussian Quadratic Chaos

Kamyar Moshksar

We visit and slightly modify the proof of Hanson-Wright inequality (HW inequality) for concentration of Gaussian quadratic chaos where we are able to tighten the bound by increasing the absolute constant in its formulation from its largest currently known value of 0.125 to at least 0.145 in the symmetric case. We also present a sharper version of the so-called Laurent-Massart inequality (LM inequality) through which we are able to increase the absolute constant in HW inequality from its largest currently available value of 0.134 due to LM inequality itself to at least 0.152 in the positive-semidefinite case. Generalizing HW inequality in the symmetric case, we derive a sequence of concentration bounds for Gaussian quadratic chaos indexed over m = 1, 2, 3,... that involves the Schatten norms of the underlying matrix. The case m = 1 reduces to HW inequality. These bounds exhibit a phase transition in behaviour in the sense that m = 1 results in the tightest bound if the deviation is smaller than a critical threshold and the bounds keep getting tighter as the index m increases when the deviation is larger than the aforementioned threshold. Finally, we derive a concentration bound that is asymptotically tighter than HW inequality both in the small and large deviation regimes.

INTERACT · INFORMS · Processing（編程語言） · 語言模型化 · 回合 ·

2023 年 5 月 22 日

Interactive Natural Language Processing

Zekun Wang,Ge Zhang,Kexin Yang,Ning Shi,Wangchunshu Zhou,Shaochun Hao,Guangzheng Xiong,Yizhi Li,Mong Yuan Sim,Xiuying Chen,Qingqing Zhu,Zhenzhu Yang,Adam Nik,Qi Liu,Chenghua Lin,Shi Wang,Ruibo Liu,Wenhu Chen,Ke Xu,Dayiheng Liu,Yike Guo,Jie Fu

from arxiv, 110 pages

Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within the field of NLP, aimed at addressing limitations in existing frameworks while aligning with the ultimate goals of artificial intelligence. This paradigm considers language models as agents capable of observing, acting, and receiving feedback iteratively from external entities. Specifically, language models in this context can: (1) interact with humans for better understanding and addressing user needs, personalizing responses, aligning with human values, and improving the overall user experience; (2) interact with knowledge bases for enriching language representations with factual knowledge, enhancing the contextual relevance of responses, and dynamically leveraging external information to generate more accurate and informed responses; (3) interact with models and tools for effectively decomposing and addressing complex tasks, leveraging specialized expertise for specific subtasks, and fostering the simulation of social behaviors; and (4) interact with environments for learning grounded representations of language, and effectively tackling embodied tasks such as reasoning, planning, and decision-making in response to environmental observations. This paper offers a comprehensive survey of iNLP, starting by proposing a unified definition and framework of the concept. We then provide a systematic classification of iNLP, dissecting its various components, including interactive objects, interaction interfaces, and interaction methods. We proceed to delve into the evaluation methodologies used in the field, explore its diverse applications, scrutinize its ethical and safety issues, and discuss prospective research directions. This survey serves as an entry point for researchers who are interested in this rapidly evolving area and offers a broad view of the current landscape and future trajectory of iNLP.

Learning · Processing（編程語言） · MoDELS · 分解的 · 表示學習 ·

2022 年 11 月 21 日

Disentangled Representation Learning

Xin Wang,Hong Chen,Si'ao Tang,Zihao Wu,Wenwu Zhu

from arxiv, 22 pages,9 figures

Disentangled Representation Learning (DRL) aims to learn a model capable of identifying and disentangling the underlying factors hidden in the observable data in representation form. The process of separating underlying factors of variation into variables with semantic meaning benefits in learning explainable representations of data, which imitates the meaningful understanding process of humans when observing an object or relation. As a general learning strategy, DRL has demonstrated its power in improving the model explainability, controlability, robustness, as well as generalization capacity in a wide range of scenarios such as computer vision, natural language processing, data mining etc. In this article, we comprehensively review DRL from various aspects including motivations, definitions, methodologies, evaluations, applications and model designs. We discuss works on DRL based on two well-recognized definitions, i.e., Intuitive Definition and Group Theory Definition. We further categorize the methodologies for DRL into four groups, i.e., Traditional Statistical Approaches, Variational Auto-encoder Based Approaches, Generative Adversarial Networks Based Approaches, Hierarchical Approaches and Other Approaches. We also analyze principles to design different DRL models that may benefit different tasks in practical applications. Finally, we point out challenges in DRL as well as potential research directions deserving future investigations. We believe this work may provide insights for promoting the DRL research in the community.

次最優 · ML · 極小點 · state-of-the-art · MoDELS ·

2020 年 12 月 10 日

Composite Adversarial Attacks

Xiaofeng Mao,Yuefeng Chen,Shuhui Wang,Hang Su,Yuan He,Hui Xue

from arxiv, To appear in AAAI 2021, code will be released later

Adversarial attack is a technique for deceiving Machine Learning (ML) models, which provides a way to evaluate the adversarial robustness. In practice, attack algorithms are artificially selected and tuned by human experts to break a ML system. However, manual selection of attackers tends to be sub-optimal, leading to a mistakenly assessment of model security. In this paper, a new procedure called Composite Adversarial Attack (CAA) is proposed for automatically searching the best combination of attack algorithms and their hyper-parameters from a candidate pool of \textbf{32 base attackers}. We design a search space where attack policy is represented as an attacking sequence, i.e., the output of the previous attacker is used as the initialization input for successors. Multi-objective NSGA-II genetic algorithm is adopted for finding the strongest attack policy with minimum complexity. The experimental result shows CAA beats 10 top attackers on 11 diverse defenses with less elapsed time (\textbf{6 $\times$ faster than AutoAttack}), and achieves the new state-of-the-art on $l_{\infty}$, $l_{2}$ and unrestricted adversarial attacks.

置信度 · MoDELS · Extensibility · 圖 · entity ·

2019 年 2 月 26 日

Embedding Uncertain Knowledge Graphs

Xuelu Chen,Muhao Chen,Weijia Shi,Yizhou Sun,Carlo Zaniolo

Embedding models for deterministic Knowledge Graphs (KG) have been extensively studied, with the purpose of capturing latent semantic relations between entities and incorporating the structured knowledge into machine learning. However, there are many KGs that model uncertain knowledge, which typically model the inherent uncertainty of relations facts with a confidence score, and embedding such uncertain knowledge represents an unresolved challenge. The capturing of uncertain knowledge will benefit many knowledge-driven applications such as question answering and semantic search by providing more natural characterization of the knowledge. In this paper, we propose a novel uncertain KG embedding model UKGE, which aims to preserve both structural and uncertainty information of relation facts in the embedding space. Unlike previous models that characterize relation facts with binary classification techniques, UKGE learns embeddings according to the confidence scores of uncertain relation facts. To further enhance the precision of UKGE, we also introduce probabilistic soft logic to infer confidence scores for unseen relation facts during training. We propose and evaluate two variants of UKGE based on different learning objectives. Experiments are conducted on three real-world uncertain KGs via three tasks, i.e. confidence prediction, relation fact ranking, and relation fact classification. UKGE shows effectiveness in capturing uncertain knowledge by achieving promising results on these tasks, and consistently outperforms baselines on these tasks.

圖卷積神經網絡/圖卷積網絡 · 圖卷積 · 圖 · Networking · 可約的 ·

2019 年 2 月 19 日

Simplifying Graph Convolutional Networks

Felix Wu,Tianyi Zhang,Amauri Holanda de Souza Jr.,Christopher Fifty,Tao Yu,Kilian Q. Weinberger

from arxiv, Code available at //github.com/Tiiiger/SGC

Graph Convolutional Networks (GCNs) and their variants have experienced significant attention and have become the de facto methods for learning graph representations. GCNs derive inspiration primarily from recent deep learning approaches, and as a result, may inherit unnecessary complexity and redundant computation. In this paper, we reduce this excess complexity through successively removing nonlinearities and collapsing weight matrices between consecutive layers. We theoretically analyze the resulting linear model and show that it corresponds to a fixed low-pass filter followed by a linear classifier. Notably, our experimental evaluation demonstrates that these simplifications do not negatively impact accuracy in many downstream applications. Moreover, the resulting model scales to larger datasets, is naturally interpretable, and yields up to two orders of magnitude speedup over FastGCN.

模式崩潰 · 對抗自編碼 · 自編碼器 · 峰值 · Better ·

2018 年 3 月 23 日

Generative Adversarial Autoencoder Networks

Ngoc-Trung Tran,Tuan-Anh Bui,Ngai-Man Cheung

We introduce an effective model to overcome the problem of mode collapse when training Generative Adversarial Networks (GAN). Firstly, we propose a new generator objective that finds it better to tackle mode collapse. And, we apply an independent Autoencoders (AE) to constrain the generator and consider its reconstructed samples as "real" samples to slow down the convergence of discriminator that enables to reduce the gradient vanishing problem and stabilize the model. Secondly, from mappings between latent and data spaces provided by AE, we further regularize AE by the relative distance between the latent and data samples to explicitly prevent the generator falling into mode collapse setting. This idea comes when we find a new way to visualize the mode collapse on MNIST dataset. To the best of our knowledge, our method is the first to propose and apply successfully the relative distance of latent and data samples for stabilizing GAN. Thirdly, our proposed model, namely Generative Adversarial Autoencoder Networks (GAAN), is stable and has suffered from neither gradient vanishing nor mode collapse issues, as empirically demonstrated on synthetic, MNIST, MNIST-1K, CelebA and CIFAR-10 datasets. Experimental results show that our method can approximate well multi-modal distribution and achieve better results than state-of-the-art methods on these benchmark datasets. Our model implementation is published here: //github.com/tntrung/gaan