欧美精品日韩精品国内精品-无遮挡又黄又刺激的免费视频

The advent of Generative AI has marked a significant milestone in artificial intelligence, demonstrating remarkable capabilities in generating realistic images, texts, and data patterns. However, these advancements come with heightened concerns over data privacy and copyright infringement, primarily due to the reliance on vast datasets for model training. Traditional approaches like differential privacy, machine unlearning, and data poisoning only offer fragmented solutions to these complex issues. Our paper delves into the multifaceted challenges of privacy and copyright protection within the data lifecycle. We advocate for integrated approaches that combines technical innovation with ethical foresight, holistically addressing these concerns by investigating and devising solutions that are informed by the lifecycle perspective. This work aims to catalyze a broader discussion and inspire concerted efforts towards data privacy and copyright integrity in Generative AI.

相關內容

生成式(shi)人工(gong)智能

關注 0

生成(cheng)式(shi)人工智(zhi)能(neng)(neng)(neng)(neng)是利用(yong)復雜的(de)(de)算法、模(mo)型和規則，從(cong)大規模(mo)數據(ju)集中學習，以創(chuang)造新(xin)的(de)(de)原(yuan)創(chuang)內(nei)容的(de)(de)人工智(zhi)能(neng)(neng)(neng)(neng)技(ji)術。這項技(ji)術能(neng)(neng)(neng)(neng)夠(gou)創(chuang)造文(wen)(wen)本(ben)、圖片、聲音、視(shi)頻(pin)和代(dai)碼等(deng)多(duo)種類型的(de)(de)內(nei)容，全面超(chao)越(yue)了(le)傳統(tong)軟件(jian)的(de)(de)數據(ju)處理和分析能(neng)(neng)(neng)(neng)力。2022年(nian)末，OpenAI推(tui)出的(de)(de)ChatGPT標志(zhi)著(zhu)(zhu)這一技(ji)術在(zai)文(wen)(wen)本(ben)生成(cheng)領域取(qu)得了(le)顯(xian)(xian)著(zhu)(zhu)進展，2023年(nian)被稱為生成(cheng)式(shi)人工智(zhi)能(neng)(neng)(neng)(neng)的(de)(de)突破之(zhi)年(nian)。這項技(ji)術從(cong)單一的(de)(de)語言生成(cheng)逐步向多(duo)模(mo)態(tai)、具身化快速(su)發(fa)展。在(zai)圖像生成(cheng)方(fang)面，生成(cheng)系(xi)統(tong)在(zai)解釋提(ti)示和生成(cheng)逼真輸出方(fang)面取(qu)得了(le)顯(xian)(xian)著(zhu)(zhu)的(de)(de)進步。同時，視(shi)頻(pin)和音頻(pin)的(de)(de)生成(cheng)技(ji)術也在(zai)迅速(su)發(fa)展，這為虛(xu)擬現(xian)實和元宇宙的(de)(de)實現(xian)提(ti)供了(le)新(xin)的(de)(de)途徑。生成(cheng)式(shi)人工智(zhi)能(neng)(neng)(neng)(neng)技(ji)術在(zai)各(ge)行業(ye)、各(ge)領域都(dou)具有廣泛的(de)(de)應(ying)用(yong)前景。

Networking · Neural Networks · 真實值 · 離散化 · 動力系統 ·

2024 年 2 月 23 日

Studying the Impact of Stochasticity on the Evaluation of Deep Neural Networks for Forest-Fire Prediction

Harshit Kumar,Biswadeep Chakraborty,Beomseok Kang,Saibal Mukhopadhyay

from arxiv, Initial draft submitted to KDD 2024

This paper presents the first systematic study of the evaluation of Deep Neural Networks (DNNs) for discrete dynamical systems under stochastic assumptions, with a focus on wildfire prediction. We develop a framework to study the impact of stochasticity on two classes of evaluation metrics: classification-based metrics, which assess fidelity to observed ground truth (GT), and proper scoring rules, which test fidelity-to-statistic. Our findings reveal that evaluating for fidelity-to-statistic is a reliable alternative in highly stochastic scenarios. We extend our analysis to real-world wildfire data, highlighting limitations in traditional wildfire prediction evaluation methods, and suggest interpretable stochasticity-compatible alternatives.

entity · 實體對齊 · 語言模型化 · 大語言模型 · 模型評估 ·

2024 年 2 月 23 日

Unlocking the Power of Large Language Models for Entity Alignment

Xuhui Jiang,Yinghan Shen,Zhichao Shi,Chengjin Xu,Wei Li,Zixuan Li,Jian Guo,Huawei Shen,Yuanzhuo Wang

Entity Alignment (EA) is vital for integrating diverse knowledge graph (KG) data, playing a crucial role in data-driven AI applications. Traditional EA methods primarily rely on comparing entity embeddings, but their effectiveness is constrained by the limited input KG data and the capabilities of the representation learning techniques. Against this backdrop, we introduce ChatEA, an innovative framework that incorporates large language models (LLMs) to improve EA. To address the constraints of limited input KG data, ChatEA introduces a KG-code translation module that translates KG structures into a format understandable by LLMs, thereby allowing LLMs to utilize their extensive background knowledge to improve EA accuracy. To overcome the over-reliance on entity embedding comparisons, ChatEA implements a two-stage EA strategy that capitalizes on LLMs' capability for multi-step reasoning in a dialogue format, thereby enhancing accuracy while preserving efficiency. Our experimental results affirm ChatEA's superior performance, highlighting LLMs' potential in facilitating EA tasks.

多樣性 · 同質 · Nuance · 相互獨立的 · 可約的 ·

2024 年 2 月 22 日

Filter Bubble or Homogenization? Disentangling the Long-Term Effects of Recommendations on User Consumption Patterns

Md Sanzeed Anwar,Grant Schoenebeck,Paramveer S. Dhillon

from arxiv, This paper was accepted at the ACM Web Conference 2024 (WWW '24)

Recommendation algorithms play a pivotal role in shaping our media choices, which makes it crucial to comprehend their long-term impact on user behavior. These algorithms are often linked to two critical outcomes: homogenization, wherein users consume similar content despite disparate underlying preferences, and the filter bubble effect, wherein individuals with differing preferences only consume content aligned with their preferences (without much overlap with other users). Prior research assumes a trade-off between homogenization and filter bubble effects and then shows that personalized recommendations mitigate filter bubbles by fostering homogenization. However, because of this assumption of a tradeoff between these two effects, prior work cannot develop a more nuanced view of how recommendation systems may independently impact homogenization and filter bubble effects. We develop a more refined definition of homogenization and the filter bubble effect by decomposing them into two key metrics: how different the average consumption is between users (inter-user diversity) and how varied an individual's consumption is (intra-user diversity). We then use a novel agent-based simulation framework that enables a holistic view of the impact of recommendation systems on homogenization and filter bubble effects. Our simulations show that traditional recommendation algorithms (based on past behavior) mainly reduce filter bubbles by affecting inter-user diversity without significantly impacting intra-user diversity. Building on these findings, we introduce two new recommendation algorithms that take a more nuanced approach by accounting for both types of diversity.

INFORMS · Learning · MoDELS · Machine Learning · CASE ·

2024 年 2 月 22 日

Privacy-Enhancing Collaborative Information Sharing through Federated Learning -- A Case of the Insurance Industry

Panyi Dong,Zhiyu Quan,Brandon Edwards,Shih-han Wang,Runhuan Feng,Tianyang Wang,Patrick Foley,Prashant Shah

The report demonstrates the benefits (in terms of improved claims loss modeling) of harnessing the value of Federated Learning (FL) to learn a single model across multiple insurance industry datasets without requiring the datasets themselves to be shared from one company to another. The application of FL addresses two of the most pressing concerns: limited data volume and data variety, which are caused by privacy concerns, the rarity of claim events, the lack of informative rating factors, etc.. During each round of FL, collaborators compute improvements on the model using their local private data, and these insights are combined to update a global model. Such aggregation of insights allows for an increase to the effectiveness in forecasting claims losses compared to models individually trained at each collaborator. Critically, this approach enables machine learning collaboration without the need for raw data to leave the compute infrastructure of each respective data owner. Additionally, the open-source framework, OpenFL, that is used in our experiments is designed so that it can be run using confidential computing as well as with additional algorithmic protections against leakage of information via the shared model updates. In such a way, FL is implemented as a privacy-enhancing collaborative learning technique that addresses the challenges posed by the sensitivity and privacy of data in traditional machine learning solutions. This paper's application of FL can also be expanded to other areas including fraud detection, catastrophe modeling, etc., that have a similar need to incorporate data privacy into machine learning collaborations. Our framework and empirical results provide a foundation for future collaborations among insurers, regulators, academic researchers, and InsurTech experts.

控制器 · Learning · 可約的 · 優化器 · Principle ·

2024 年 2 月 21 日

Learning Optimal Control with Stochastic Models of Hamiltonian Dynamics

Chandrajit Bajaj,Minh Nguyen

from arxiv, 13 pages, 8 figures

Optimal control problems can be solved by applying the Pontryagin maximum principle and then solving for a Hamiltonian dynamical system. In this paper, we propose novel learning frameworks to tackle optimal control problems. By applying the Pontryagin maximum principle to the original optimal control problem, the learning focus shifts to reduced Hamiltonian dynamics and corresponding adjoint variables. The reduced Hamiltonian networks can be learned by going backward in time and then minimizing loss function deduced from the Pontryagin maximum principle's conditions. The learning process is further improved by progressively learning a posterior distribution of reduced Hamiltonians, utilizing a variational autoencoder which leads to more effective path exploration process. We apply our learning frameworks to control tasks and obtain competitive results.

Extensibility · 輸出 · 講稿 · 數據庫 · 算法與數據結構 ·

2024 年 2 月 21 日

Polynomial Time Convergence of the Iterative Evaluation of Datalogo Programs

Sungjin Im,Benjamin Moseley,Hung Q. Ngo,Kirk Pruhs

Datalogo is an extension of Datalog that allows for aggregation and recursion over an arbitrary commutative semiring. Like Datalog, Datalogo programs can be evaluated via the natural iterative algorithm until a fixed point is reached. However unlike Datalog, the natural iterative evaluation of some Datalogo programs over some semirings may not converge. It is known that the commutative semirings for which the iterative evaluation of Datalogo programs is guaranteed to converge are exactly those semirings that are stable [7]. Previously, the best known upper bound on the number of iterations until convergence over $p$-stable semirings is $\sum_{i=1}^n (p+2)^i = \Theta(p^n)$ steps, where $n$ is (essentially) the output size. We establish that, in fact, the natural iterative evaluation of a Datalogoprogram over a $p$-stable semiring converges within a polynomial number of iterations. In particular our upper bound is $O( \sigma p n^2( n^2 \lg \lambda + \lg \sigma))$ where $\sigma$ is the number of elements in the semiring present in either the input databases or the Datalogo program, and $\lambda$ is the maximum number of terms in any product in the Datalogo program.

任務對話系統 · Automator · Performer · AIM · 可理解性 ·

2024 年 2 月 21 日

Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation Dialogues

Deuksin Kwon,Emily Weiss,Tara Kulshrestha,Kushal Chawla,Gale M. Lucas,Jonathan Gratch

A successful negotiation demands a deep comprehension of the conversation context, Theory-of-Mind (ToM) skills to infer the partner's motives, as well as strategic reasoning and effective communication, making it challenging for automated systems. Given the remarkable performance of LLMs across a variety of NLP tasks, in this work, we aim to understand how LLMs can advance different aspects of negotiation research, ranging from designing dialogue systems to providing pedagogical feedback and scaling up data collection practices. To this end, we devise a methodology to analyze the multifaceted capabilities of LLMs across diverse dialogue scenarios covering all the time stages of a typical negotiation interaction. Our analysis adds to the increasing evidence for the superiority of GPT-4 across various tasks while also providing insights into specific tasks that remain difficult for LLMs. For instance, the models correlate poorly with human players when making subjective assessments about the negotiation dialogues and often struggle to generate responses that are contextually appropriate as well as strategically advantageous.

估計/估計量 · 穩健性 · GROUP · 極大似然估計 · 損失 ·

2024 年 2 月 21 日

Robust Estimation of the Tail Index of a Single Parameter Pareto Distribution from Grouped Data

Chudamani Poudyal

from arxiv, 18 pages, 1 figure, 6 tables

Numerous robust estimators exist as alternatives to the maximum likelihood estimator (MLE) when a completely observed ground-up loss severity sample dataset is available. However, the options for robust alternatives to MLE become significantly limited when dealing with grouped loss severity data, with only a handful of methods like least squares, minimum Hellinger distance, and optimal bounded influence function available. This paper introduces a novel robust estimation technique, the Method of Truncated Moments (MTuM), specifically designed to estimate the tail index of a Pareto distribution from grouped data. Inferential justification of MTuM is established by employing the central limit theorem and validating them through a comprehensive simulation study.

操作 · INFORMS · 相互獨立的 · Better · 多樣性 ·

2024 年 2 月 16 日

Operational Collective Intelligence of Humans and Machines

Nikolos Gurney,Fred Morstatter,David V. Pynadath,Adam Russell,Gleb Satyukov

We explore the use of aggregative crowdsourced forecasting (ACF) as a mechanism to help operationalize ``collective intelligence'' of human-machine teams for coordinated actions. We adopt the definition for Collective Intelligence as: ``A property of groups that emerges from synergies among data-information-knowledge, software-hardware, and individuals (those with new insights as well as recognized authorities) that enables just-in-time knowledge for better decisions than these three elements acting alone.'' Collective Intelligence emerges from new ways of connecting humans and AI to enable decision-advantage, in part by creating and leveraging additional sources of information that might otherwise not be included. Aggregative crowdsourced forecasting (ACF) is a recent key advancement towards Collective Intelligence wherein predictions (X\% probability that Y will happen) and rationales (why I believe it is this probability that X will happen) are elicited independently from a diverse crowd, aggregated, and then used to inform higher-level decision-making. This research asks whether ACF, as a key way to enable Operational Collective Intelligence, could be brought to bear on operational scenarios (i.e., sequences of events with defined agents, components, and interactions) and decision-making, and considers whether such a capability could provide novel operational capabilities to enable new forms of decision-advantage.

MoDELS · Pivotal（公司） · 通用智能 · 語言模型化 · 多峰值 ·

2024 年 1 月 25 日

A Survey of Reasoning with Foundation Models

Jiankai Sun,Chuanyang Zheng,Enze Xie,Zhengying Liu,Ruihang Chu,Jianing Qiu,Jiaqi Xu,Mingyu Ding,Hongyang Li,Mengzhe Geng,Yue Wu,Wenhai Wang,Junsong Chen,Zhangyue Yin,Xiaozhe Ren,Jie Fu,Junxian He,Wu Yuan,Qi Liu,Xihui Liu,Yu Li,Hao Dong,Yu Cheng,Ming Zhang,Pheng Ann Heng,Jifeng Dai,Ping Luo,Jingdong Wang,Ji-Rong Wen,Xipeng Qiu,Yike Guo,Hui Xiong,Qun Liu,Zhenguo Li

from arxiv, 20 Figures, 160 Pages, 750+ References, Project Page //github.com/reasoning-survey/Awesome-Reasoning-Foundation-Models

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-world settings such as negotiation, medical diagnosis, and criminal investigation. It serves as a fundamental methodology in the field of Artificial General Intelligence (AGI). With the ongoing development of foundation models, e.g., Large Language Models (LLMs), there is a growing interest in exploring their abilities in reasoning tasks. In this paper, we introduce seminal foundation models proposed or adaptable for reasoning, highlighting the latest advancements in various reasoning tasks, methods, and benchmarks. We then delve into the potential future directions behind the emergence of reasoning abilities within foundation models. We also discuss the relevance of multimodal learning, autonomous agents, and super alignment in the context of reasoning. By discussing these future research directions, we hope to inspire researchers in their exploration of this field, stimulate further advancements in reasoning with foundation models, and contribute to the development of AGI.