久久久久精品电影,久久久一卡二卡三卡四卡

Arbitrary, inconsistent, or faulty decision-making raises serious concerns, and preventing unfair models is an increasingly important challenge in Machine Learning. Data often reflect past discriminatory behavior, and models trained on such data may reflect bias on sensitive attributes, such as gender, race, or age. One approach to developing fair models is to preprocess the training data to remove the underlying biases while preserving the relevant information, for example, by correcting biased labels. While multiple label noise correction methods are available, the information about their behavior in identifying discrimination is very limited. In this work, we develop an empirical methodology to systematically evaluate the effectiveness of label noise correction techniques in ensuring the fairness of models trained on biased datasets. Our methodology involves manipulating the amount of label noise and can be used with fairness benchmarks but also with standard ML datasets. We apply the methodology to analyze six label noise correction methods according to several fairness metrics on standard OpenML datasets. Our results suggest that the Hybrid Label Noise Correction method achieves the best trade-off between predictive performance and fairness. Clustering-Based Correction can reduce discrimination the most, however, at the cost of lower predictive performance.

相關內容

Facebook AI Research

關注 10

MoDELS · Performer · 語言模型化 · 可理解性 · 有向 ·

2023 年 8 月 21 日

Large Linguistic Models: Analyzing theoretical linguistic abilities of LLMs

Ga?per Begu?,Maksymilian D?bkowski,Ryan Rhodes

The performance of large language models (LLMs) has recently improved to the point where the models can perform well on many language tasks. We show here that for the first time, the models can also generate coherent and valid formal analyses of linguistic data and illustrate the vast potential of large language models for analyses of their metalinguistic abilities. LLMs are primarily trained on language data in the form of text; analyzing and evaluating their metalinguistic abilities improves our understanding of their general capabilities and sheds new light on theoretical models in linguistics. In this paper, we probe into GPT-4's metalinguistic capabilities by focusing on three subfields of formal linguistics: syntax, phonology, and semantics. We outline a research program for metalinguistic analyses of large language models, propose experimental designs, provide general guidelines, discuss limitations, and offer future directions for this line of research. This line of inquiry also exemplifies behavioral interpretability of deep learning, where models' representations are accessed by explicit prompting rather than internal representations.

有限差分 · Ghost（博客程序） · MoDELS · 離散化 · 值域 ·

2023 年 8 月 21 日

Instabilities of explicit finite difference schemes with ghost points on the diffusion equation

Fabien Le Floc'h

Ghost, or fictitious points allow to capture boundary conditions that are not located on the finite difference grid discretization. We explore in this paper the impact of ghost points on the stability of the explicit Euler finite difference scheme in the context of the diffusion equation. In particular, we consider the case of a one-touch option under the Black-Scholes model. The observations and results are however valid for a much wider range of financial contracts and models.

Automator · 講稿 · 模態 · 人工智能 · 符號學 ·

2023 年 8 月 21 日

Normative conditional reasoning as a fragment of HOL

Xavier Parenta,Christoph Benzmüller

from arxiv, 22 pages, 28 figures, 3 tables

We report some results regarding the mechanization of normative (preference-based) conditional reasoning. Our focus is on Aqvist's system E for conditional obligation (and its extensions). Our mechanization is achieved via a shallow semantical embedding in Isabelle/HOL. We consider two possible uses of the framework. The first one is as a tool for meta-reasoning about the considered logic. We employ it for the automated verification of deontic correspondences (broadly conceived) and related matters, analogous to what has been previously achieved for the modal logic cube. The second use is as a tool for assessing ethical arguments. We provide a computer encoding of a well-known paradox in population ethics, Parfit's repugnant conclusion. Whether the presented encoding increases or decreases the attractiveness and persuasiveness of the repugnant conclusion is a question we would like to pass on to philosophy and ethics.

MoDELS · Analysis · 有向 · 泛函 · Nuance ·

2023 年 8 月 20 日

A unified approach to radial, hyperbolic, and directional efficiency measurement in Data Envelopment Analysis

Margaréta Halická,Mária Trnovská,Ale? ?erny

from arxiv, 36 pages

The paper analyses properties of a large class of "path-based" Data Envelopment Analysis models through a unifying general scheme. The scheme includes the well-known oriented radial models, the hyperbolic distance function model, the directional distance function models, and even permits their generalisations. The modelling is not constrained to non-negative data and is flexible enough to accommodate variants of standard models over arbitrary data. Mathematical tools developed in the paper allow systematic analysis of the models from the point of view of ten desirable properties. It is shown that some of the properties are satisfied (resp., fail) for all models in the general scheme, while others have a more nuanced behaviour and must be assessed individually in each model. Our results can help researchers and practitioners navigate among the different models and apply the models to mixed data.

Extensibility · 控制器 · Performer · MoDELS · 多樣性 ·

2023 年 8 月 19 日

ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval

Kaihang Pan,Juncheng Li,Hongye Song,Hao Fei,Wei Ji,Shuo Zhang,Jun Lin,Xiaozhong Liu,Siliang Tang

Recent studies have shown that dense retrieval models, lacking dedicated training data, struggle to perform well across diverse retrieval tasks, as different retrieval tasks often entail distinct search intents. To address this challenge, in this work we introduce ControlRetriever, a generic and efficient approach with a parameter isolated architecture, capable of controlling dense retrieval models to directly perform varied retrieval tasks, harnessing the power of instructions that explicitly describe retrieval intents in natural language. Leveraging the foundation of ControlNet, which has proven powerful in text-to-image generation, ControlRetriever imbues different retrieval models with the new capacity of controllable retrieval, all while being guided by task-specific instructions. Furthermore, we propose a novel LLM guided Instruction Synthesizing and Iterative Training strategy, which iteratively tunes ControlRetriever based on extensive automatically-generated retrieval data with diverse instructions by capitalizing the advancement of large language models. Extensive experiments show that in the BEIR benchmark, with only natural language descriptions of specific retrieval intent for each task, ControlRetriever, as a unified multi-task retrieval system without task-specific tuning, significantly outperforms baseline methods designed with task-specific retrievers and also achieves state-of-the-art zero-shot performance.

Extensibility · 奇異的 · 正定 · 類別 · 數值分析 ·

2023 年 8 月 19 日

The extension of Weyl-type relative perturbation bounds

Haoyuan Ma

from arxiv, 37 pages

Relative perturbation theory for eigenvalues of Hermitian positive definite matrices has been well-studied, and the major results were later derived analogously for Hermitian non-singular matrices. In this dissertation we extend several relative perturbation results to Hermitian matrices that are potentially singular, and also develop a general class of relative bounds for Hermitian matrices. As a result, corresponding relative bounds for singular values of rank-deficient $m\times n$ matrices are also obtained using related Jordan-Wielandt matrices. We also discuss a comparison between the main relative bound derived and the Weyl's absolute perturbation bound in terms of their sharpness and derivation in practice.

神經元 · Pattern Recognition · Networking · 人工神經元 · Neural Networks ·

2023 年 8 月 17 日

Pattern recognition using spiking antiferromagnetic neurons

Hannah Bradley,Steven Louis,Andrei Slavin,Vasyl Tyberkevych

Spintronic devices offer a promising avenue for the development of nanoscale, energy-efficient artificial neurons for neuromorphic computing. It has previously been shown that with antiferromagnetic (AFM) oscillators, ultra-fast spiking artificial neurons can be made that mimic many unique features of biological neurons. In this work, we train an artificial neural network of AFM neurons to perform pattern recognition. A simple machine learning algorithm called spike pattern association neuron (SPAN), which relies on the temporal position of neuron spikes, is used during training. In under a microsecond of physical time, the AFM neural network is trained to recognize symbols composed from a grid by producing a spike within a specified time window. We further achieve multi-symbol recognition with the addition of an output layer to suppress undesirable spikes. Through the utilization of AFM neurons and the SPAN algorithm, we create a neural network capable of high-accuracy recognition with overall power consumption on the order of picojoules.

Better · CoT · 小樣本學習 · 匯聚 · Performer ·

2023 年 8 月 16 日

Better patching using LLM prompting, via Self-Consistency

Toufique Ahmed,Premkumar Devanbu

from arxiv, Accepted at ASE-NIER (2023) track

Large Language models (LLMs) can be induced to solve non-trivial problems with "few-shot" prompts including illustrative problem-solution examples. Now if the few-shots also include "chain of thought" (CoT) explanations, which are of the form problem-explanation-solution, LLMs will generate a "explained" solution, and perform even better. Recently an exciting, substantially better technique, self-consistency [1] (S-C) has emerged, based on the intuition that there are many plausible explanations for the right solution; when the LLM is sampled repeatedly to generate a pool of explanation-solution pairs, for a given problem, the most frequently occurring solutions in the pool (ignoring the explanations) tend to be even more likely to be correct! Unfortunately, the use of this highly-performant S-C (or even CoT) approach in software engineering settings is hampered by the lack of explanations; most software datasets lack explanations. In this paper, we describe an application of the S-C approach to program repair, using the commit log on the fix as the explanation, only in the illustrative few-shots. We achieve state-of-the art results, beating previous approaches to prompting-based program repair, on the MODIT dataset; we also find evidence suggesting that the correct commit messages are helping the LLM learn to produce better patches.

MoDELS · 控制器 · AI · Performer · 可理解性 ·

2023 年 8 月 16 日

Artistic control over the glitch in AI-generated motion capture

Jamal Knight,Andrew Johnston,Adam Berry

Artificial intelligence (AI) models are prevalent today and provide a valuable tool for artists. However, a lesser-known artifact that comes with AI models that is not always discussed is the glitch. Glitches occur for various reasons; sometimes, they are known, and sometimes they are a mystery. Artists who use AI models to generate art might not understand the reason for the glitch but often want to experiment and explore novel ways of augmenting the output of the glitch. This paper discusses some of the questions artists have when leveraging the glitch in AI art production. It explores the unexpected positive outcomes produced by glitches in the specific context of motion capture and performance art.

INFORMS · 話題 · 自然語言處理 · 多媒體 · 進化計算 ·

2021 年 9 月 11 日

A Survey on Multi-modal Summarization

Anubhav Jangra,Adam Jatowt,Sriparna Saha,Mohammad Hasanuzzaman

The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS.