欧美狂野视频一区国产精品_色欲91精品国产免费观看_日韩一区二区三区四区黄色网站_日本中文字幕中出在线_高级精品久久久久精品三级_国产精品国产亚洲精品看不长_东京热一本之道加勒比久久

As Large Language Models (LLMs) gain great success in real-world applications, an increasing number of users are seeking to develop and deploy their customized LLMs through cloud services. Nonetheless, in some specific domains, there are still concerns regarding cost and trade-offs between privacy issues and accuracy. In this study, we introduce a cost-effective and self-adaptive LLM shaking tuning and recovery mechanism, named CypherTalk. With carefully designed horizontal and vertical shaking operators, we can achieve comparable accuracy results with SOTA privacy-preserving LLM schemes using Cryptography-based or Differential Privacy-based methods. Experiments also show that with the CypherTalk framework, users can achieve reliable accuracy when using optimized shaking operator settings. To our best knowledge, this is the first work that considers cost, and trade-off between model utility and privacy in LLM scenarios.

相關內容

大語言模型

關注 56

大語(yu)(yu)(yu)(yu)言模型是基于海量文(wen)(wen)本(ben)數(shu)(shu)據訓練的(de)(de)(de)(de)深(shen)(shen)度學(xue)習模型。它(ta)(ta)不(bu)(bu)僅能夠生(sheng)(sheng)成自(zi)然(ran)語(yu)(yu)(yu)(yu)言文(wen)(wen)本(ben)，還能夠深(shen)(shen)入理(li)解(jie)文(wen)(wen)本(ben)含(han)義，處理(li)各種自(zi)然(ran)語(yu)(yu)(yu)(yu)言任務，如文(wen)(wen)本(ben)摘要、問答、翻譯(yi)等(deng)。2023年(nian)，大語(yu)(yu)(yu)(yu)言模型及其(qi)在人工智能領域的(de)(de)(de)(de)應(ying)用已成為(wei)全球科技(ji)研(yan)究的(de)(de)(de)(de)熱點，其(qi)在規模上(shang)的(de)(de)(de)(de)增長(chang)尤為(wei)引人注目，參(can)數(shu)(shu)量已從最初的(de)(de)(de)(de)十(shi)幾(ji)億躍升(sheng)到如今的(de)(de)(de)(de)一萬億。參(can)數(shu)(shu)量的(de)(de)(de)(de)提升(sheng)使得模型能夠更(geng)加(jia)精(jing)細地(di)捕捉人類語(yu)(yu)(yu)(yu)言微妙(miao)之處，更(geng)加(jia)深(shen)(shen)入地(di)理(li)解(jie)人類語(yu)(yu)(yu)(yu)言的(de)(de)(de)(de)復雜性(xing)。在過去的(de)(de)(de)(de)一年(nian)里，大語(yu)(yu)(yu)(yu)言模型在吸納新知識、分解(jie)復雜任務以及圖(tu)文(wen)(wen)對齊等(deng)多方面都(dou)有(you)顯著提升(sheng)。隨著技(ji)術的(de)(de)(de)(de)不(bu)(bu)斷成熟，它(ta)(ta)將不(bu)(bu)斷拓展其(qi)應(ying)用范(fan)圍，為(wei)人類提供(gong)更(geng)加(jia)智能化(hua)和個性(xing)化(hua)的(de)(de)(de)(de)服(fu)務，進(jin)一步(bu)改善人們的(de)(de)(de)(de)生(sheng)(sheng)活(huo)和生(sheng)(sheng)產方式。

INFORMS · Learning · Analysis · GROUP · 圖 ·

2024 年 4 月 24 日

Human-in-the-loop Learning for Dynamic Congestion Games

Hongbo Li,Lingjie Duan

from arxiv, This paper has been accepted by IEEE Transactions on Mobile Computing (2024)

Today mobile users learn and share their traffic observations via crowdsourcing platforms (e.g., Waze). Yet such platforms simply cater to selfish users' myopic interests to recommend the shortest path, and do not encourage enough users to travel and learn other paths for future others. Prior studies focus on one-shot congestion games without considering users' information learning, while our work studies how users learn and alter traffic conditions on stochastic paths in a human-in-the-loop manner. Our analysis shows that the myopic routing policy leads to severe under-exploration of stochastic paths. This results in a price of anarchy (PoA) greater than $2$, as compared to the socially optimal policy in minimizing the long-term social cost. Besides, the myopic policy fails to ensure the correct learning convergence about users' traffic hazard beliefs. To address this, we focus on informational (non-monetary) mechanisms as they are easier to implement than pricing. We first show that existing information-hiding mechanisms and deterministic path-recommendation mechanisms in Bayesian persuasion literature do not work with even (\text{PoA}=\infty). Accordingly, we propose a new combined hiding and probabilistic recommendation (CHAR) mechanism to hide all information from a selected user group and provide state-dependent probabilistic recommendations to the other user group. Our CHAR successfully ensures PoA less than (\frac{5}{4}), which cannot be further reduced by any other informational (non-monetary) mechanism. Besides the parallel network, we further extend our analysis and CHAR to more general linear path graphs with multiple intermediate nodes, and we prove that the PoA results remain unchanged. Additionally, we carry out experiments with real-world datasets to further extend our routing graphs and verify the close-to-optimal performance of our CHAR.

泛化理論 · Performer · 特化 · Learning · MoDELS ·

2024 年 4 月 23 日

Cross-Domain Causal Preference Learning for Out-of-Distribution Recommendation

Zhuhang Li,Ning Yang

from arxiv, 16 pages, 5 figures, accepted by DASFAA2024

Recommender systems use users' historical interactions to learn their preferences and deliver personalized recommendations from a vast array of candidate items. Current recommender systems primarily rely on the assumption that the training and testing datasets have identical distributions, which may not hold true in reality. In fact, the distribution shift between training and testing datasets often occurs as a result of the evolution of user attributes, which degrades the performance of the conventional recommender systems because they fail in Out-of-Distribution (OOD) generalization, particularly in situations of data sparsity. This study delves deeply into the challenge of OOD generalization and proposes a novel model called Cross-Domain Causal Preference Learning for Out-of-Distribution Recommendation (CDCOR), which involves employing a domain adversarial network to uncover users' domain-shared preferences and utilizing a causal structure learner to capture causal invariance to deal with the OOD problem. Through extensive experiments on two real-world datasets, we validate the remarkable performance of our model in handling diverse scenarios of data sparsity and out-of-distribution environments. Furthermore, our approach surpasses the benchmark models, showcasing outstanding capabilities in out-of-distribution generalization.

大語言模型 · 語言模型化 · MoDELS · CASES · 數據集 ·

2024 年 4 月 23 日

A User-Centric Benchmark for Evaluating Large Language Models

Jiayin Wang,Fengran Mo,Weizhi Ma,Peijie Sun,Min Zhang,Jian-Yun Nie

Large Language Models (LLMs) are essential tools to collaborate with users on different tasks. Evaluating their performance to serve users' needs in real-world scenarios is important. While many benchmarks have been created, they mainly focus on specific predefined model abilities. Few have covered the intended utilization of LLMs by real users. To address this oversight, we propose benchmarking LLMs from a user perspective in both dataset construction and evaluation designs. We first collect 1846 real-world use cases with 15 LLMs from a user study with 712 participants from 23 countries. These self-reported cases form the User Reported Scenarios(URS) dataset with a categorization of 7 user intents. Secondly, on this authentic multi-cultural dataset, we benchmark 10 LLM services on their efficacy in satisfying user needs. Thirdly, we show that our benchmark scores align well with user-reported experience in LLM interactions across diverse intents, both of which emphasize the overlook of subjective scenarios. In conclusion, our study proposes to benchmark LLMs from a user-centric perspective, aiming to facilitate evaluations that better reflect real user needs. The benchmark dataset and code are available at //github.com/Alice1998/URS.

Twitter · Learning · surge · 運動行為分析 · 分解的 ·

2024 年 4 月 22 日

Exploring Platform Migration Patterns between Twitter and Mastodon: A User Behavior Study

Ujun Jeong,Paras Sheth,Anique Tahir,Faisal Alatawi,H. Russell Bernard,Huan Liu

from arxiv, Accepted to ICWSM 24 (Please cite it accordingly)

A recent surge of users migrating from Twitter to alternative platforms, such as Mastodon, raised questions regarding what migration patterns are, how different platforms impact user behaviors, and how migrated users settle in the migration process. In this study, we elaborate on how we investigate these questions by collecting data over 10,000 users who migrated from Twitter to Mastodon within the first ten weeks following the ownership change of Twitter. Our research is structured in three primary steps. First, we develop algorithms to extract and analyze migration patterns. Second, by leveraging behavioral analysis, we examine the distinct architectures of Twitter and Mastodon to learn how user behaviors correspond with the characteristics of each platform. Last, we determine how particular behavioral factors influence users to stay on Mastodon. We share our findings of user migration, insights, and lessons learned from the user behavior study.

INTERACT · VR · Performer · 設計 · TOOLS ·

2024 年 4 月 20 日

Two-In-One: A Design Space for Mapping Unimanual Input into Bimanual Interactions in VR for Users with Limited Movement

Momona Yamagami,Sasa Junuzovic,Mar Gonzalez-Franco,Eyal Ofek,Edward Cutrell,John R. Porter,Andrew D. Wilson,Martez E. Mott

from arxiv, 26 pages, 3 figures, 6 tables

Virtual Reality (VR) applications often require users to perform actions with two hands when performing tasks and interacting with objects in virtual environments. Although bimanual interactions in VR can resemble real-world interactions -- thus increasing realism and improving immersion -- they can also pose significant accessibility challenges to people with limited mobility, such as for people who have full use of only one hand. An opportunity exists to create accessible techniques that take advantage of users' abilities, but designers currently lack structured tools to consider alternative approaches. To begin filling this gap, we propose Two-in-One, a design space that facilitates the creation of accessible methods for bimanual interactions in VR from unimanual input. Our design space comprises two dimensions, bimanual interactions and computer assistance, and we provide a detailed examination of issues to consider when creating new unimanual input techniques that map to bimanual interactions in VR. We used our design space to create three interaction techniques that we subsequently implemented for a subset of bimanual interactions and received user feedback through a video elicitation study with 17 people with limited mobility. Our findings explore complex tradeoffs associated with autonomy and agency and highlight the need for additional settings and methods to make VR accessible to people with limited mobility.

XAI · MoDELS · AI · AIM · 可理解性 ·

2024 年 4 月 19 日

Improving User Mental Models of XAI Systems with Inclusive Design Approaches

Md Montaser Hamid,Fatima Moussaoui,Jimena Noa Guevara,Andrew Anderson,Margaret Burnett

Explainable Artificial Intelligence (XAI) systems aim to improve users' understanding of AI but rarely consider the inclusivity aspects of XAI. Without inclusive approaches, improving explanations might not work well for everyone. This study investigates leveraging users' diverse problem-solving styles as an inclusive strategy to fix an XAI prototype, with the ultimate goal of improving users' mental models of AI. We ran a between-subject study with 69 participants. Our results show that the inclusivity fixes increased participants' engagement with explanations and produced significantly improved mental models. Analyzing differences in mental model scores further highlighted specific inclusivity fixes that contributed to the significant improvement in the mental model.

詞元分析器 · MoDELS · Taxonomy · Seven · 可辨認的 ·

2024 年 4 月 19 日

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

Yuxi Li,Yi Liu,Gelei Deng,Ying Zhang,Wenjia Song,Ling Shi,Kailong Wang,Yuekang Li,Yang Liu,Haoyu Wang

With the expanding application of Large Language Models (LLMs) in various domains, it becomes imperative to comprehensively investigate their unforeseen behaviors and consequent outcomes. In this study, we introduce and systematically explore the phenomenon of "glitch tokens", which are anomalous tokens produced by established tokenizers and could potentially compromise the models' quality of response. Specifically, we experiment on seven top popular LLMs utilizing three distinct tokenizers and involving a totally of 182,517 tokens. We present categorizations of the identified glitch tokens and symptoms exhibited by LLMs when interacting with glitch tokens. Based on our observation that glitch tokens tend to cluster in the embedding space, we propose GlitchHunter, a novel iterative clustering-based technique, for efficient glitch token detection. The evaluation shows that our approach notably outperforms three baseline methods on eight open-source LLMs. To the best of our knowledge, we present the first comprehensive study on glitch tokens. Our new detection further provides valuable insights into mitigating tokenization-related errors in LLMs.

Analysis · 數據增強 · 數據集 · 可理解性 · HTTPS ·

2024 年 4 月 18 日

Aligning Actions and Walking to LLM-Generated Textual Descriptions

Radu Chivereanu,Adrian Cosma,Andy Catruna,Razvan Rughinis,Emilian Radoi

from arxiv, Accepted at 2nd Workshop on Learning with Few or without Annotated Face, Body and Gesture Data

Large Language Models (LLMs) have demonstrated remarkable capabilities in various domains, including data augmentation and synthetic data generation. This work explores the use of LLMs to generate rich textual descriptions for motion sequences, encompassing both actions and walking patterns. We leverage the expressive power of LLMs to align motion representations with high-level linguistic cues, addressing two distinct tasks: action recognition and retrieval of walking sequences based on appearance attributes. For action recognition, we employ LLMs to generate textual descriptions of actions in the BABEL-60 dataset, facilitating the alignment of motion sequences with linguistic representations. In the domain of gait analysis, we investigate the impact of appearance attributes on walking patterns by generating textual descriptions of motion sequences from the DenseGait dataset using LLMs. These descriptions capture subtle variations in walking styles influenced by factors such as clothing choices and footwear. Our approach demonstrates the potential of LLMs in augmenting structured motion attributes and aligning multi-modal representations. The findings contribute to the advancement of comprehensive motion understanding and open up new avenues for leveraging LLMs in multi-modal alignment and data augmentation for motion analysis. We make the code publicly available at //github.com/Radu1999/WalkAndText

contrastive · MoDELS · 對比學習 · Learning · INTERACT ·

2024 年 4 月 18 日

Knowledge-Aware Multi-Intent Contrastive Learning for Multi-Behavior Recommendation

Shunpan Liang,Junjie Zhao,Chen Li,Yu Lei

Multi-behavioral recommendation optimizes user experiences by providing users with more accurate choices based on their diverse behaviors, such as view, add to cart, and purchase. Current studies on multi-behavioral recommendation mainly explore the connections and differences between multi-behaviors from an implicit perspective. Specifically, they directly model those relations using black-box neural networks. In fact, users' interactions with items under different behaviors are driven by distinct intents. For instance, when users view products, they tend to pay greater attention to information such as ratings and brands. However, when it comes to the purchasing phase, users become more price-conscious. To tackle this challenge and data sparsity problem in the multi-behavioral recommendation, we propose a novel model: Knowledge-Aware Multi-Intent Contrastive Learning (KAMCL) model. This model uses relationships in the knowledge graph to construct intents, aiming to mine the connections between users' multi-behaviors from the perspective of intents to achieve more accurate recommendations. KAMCL is equipped with two contrastive learning schemes to alleviate the data scarcity problem and further enhance user representations. Extensive experiments on three real datasets demonstrate the superiority of our model.

2018 年 1 月 5 日

Deep Reinforcement Learning for List-wise Recommendations

Xiangyu Zhao,Liang Zhang,Zhuoye Ding,Dawei Yin,Yihong Zhao,Jiliang Tang

Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously improving its strategies during the interactions with users. We model the sequential interactions between users and a recommender system as a Markov Decision Process (MDP) and leverage Reinforcement Learning (RL) to automatically learn the optimal strategies via recommending trial-and-error items and receiving reinforcements of these items from users' feedbacks. In particular, we introduce an online user-agent interacting environment simulator, which can pre-train and evaluate model parameters offline before applying the model online. Moreover, we validate the importance of list-wise recommendations during the interactions between users and agent, and develop a novel approach to incorporate them into the proposed framework LIRD for list-wide recommendations. The experimental results based on a real-world e-commerce dataset demonstrate the effectiveness of the proposed framework.