草莓视频在线观看免费完整_啊在线不卡视频无码_亚洲色图自拍偷拍精品无码_久久精品国产只有精品16_老色鬼精品区在线视频_久久久无码精品亚洲日韩在_久久综合狠狠综合久久综合86

This paper presents a Virtual Reality (VR) art therapy known as "Break Times" which aims to enhance students' mental well-being and foster creative expression. The proposed "Break Times" application mimics the art therapy sessions in the VR environment design. Pilot user acceptance test with 10 participants showed a notable reduction in stress levels, with 50% reporting normal stress levels post-intervention, compared to 20% pre-intervention. Participants praised the "Break Times" therapy's functionality and engagement features and suggested improvements such as saving creations, incorporating 3D painting, and expanding the artmaking scene variety. The study highlights that VR art therapy has potential as an effective tool for stress management, emphasizing the need for continued refinement to maximize its therapeutic benefits.

相關內容

關注 23

IEEE虛擬現(xian)實(shi)會議一(yi)直是展示虛擬現(xian)實(shi)(VR)廣(guang)泛領(ling)域(yu)研(yan)究(jiu)(jiu)成果(guo)的主(zhu)要國際場所，包括(kuo)增(zeng)強現(xian)實(shi)（AR），混(hun)合現(xian)實(shi)（MR）和(he)3D用(yong)(yong)戶界面中尋(xun)求高質量(liang)的原創論(lun)文(wen)(wen)。每篇論(lun)文(wen)(wen)應(ying)(ying)(ying)歸(gui)類(lei)為主(zhu)要涵蓋研(yan)究(jiu)(jiu)，應(ying)(ying)(ying)用(yong)(yong)程序或系統，并(bing)使(shi)用(yong)(yong)以下準則進(jin)行分類(lei)：研(yan)究(jiu)(jiu)論(lun)文(wen)(wen)應(ying)(ying)(ying)描(miao)述有(you)助于先進(jin)軟件，硬件，算(suan)法，交互或人為因素發展的結果(guo)。應(ying)(ying)(ying)用(yong)(yong)論(lun)文(wen)(wen)應(ying)(ying)(ying)解(jie)(jie)釋作者如何基于現(xian)有(you)思想(xiang)并(bing)將其應(ying)(ying)(ying)用(yong)(yong)到以新穎的方式解(jie)(jie)決有(you)趣(qu)的問題。每篇論(lun)文(wen)(wen)都應(ying)(ying)(ying)包括(kuo)對給定(ding)應(ying)(ying)(ying)用(yong)(yong)領(ling)域(yu)中VR/AR/MR使(shi)用(yong)(yong)成功的評(ping)估。官網地址：

MoDELS · 語言模型化 · Processing（編程語言） · 穩健性 · Performer ·

2024 年 12 月 20 日

Language Models Resist Alignment: Evidence From Data Compression

Jiaming Ji,Kaile Wang,Tianyi Qiu,Boyuan Chen,Jiayi Zhou,Changye Li,Hantao Lou,Josef Dai,Yunhuai Liu,Yaodong Yang

from arxiv, The five-page version has been accepted by NeurIPS 2024 Workshop SoLaR. In the current version, we have conducted an in-depth expansion of both the theoretical and experimental aspects

Large language models (LLMs) may exhibit unintended or undesirable behaviors. Recent works have concentrated on aligning LLMs to mitigate harmful outputs. Despite these efforts, some anomalies indicate that even a well-conducted alignment process can be easily circumvented, whether intentionally or accidentally. Does alignment fine-tuning yield have robust effects on models, or are its impacts merely superficial? In this work, we make the first exploration of this phenomenon from both theoretical and empirical perspectives. Empirically, we demonstrate the elasticity of post-alignment models, i.e., the tendency to revert to the behavior distribution formed during the pre-training phase upon further fine-tuning. Leveraging compression theory, we formally deduce that fine-tuning disproportionately undermines alignment relative to pre-training, potentially by orders of magnitude. We validate the presence of elasticity through experiments on models of varying types and scales. Specifically, we find that model performance declines rapidly before reverting to the pre-training distribution, after which the rate of decline drops significantly. Furthermore, we further reveal that elasticity positively correlates with the increased model size and the expansion of pre-training data. Our findings underscore the need to address the inherent elasticity of LLMs to mitigate their resistance to alignment.

情景 · state-of-the-art · Networks · Learning · Networking ·

2024 年 12 月 19 日

Answer Set Networks: Casting Answer Set Programming into Deep Learning

Arseny Skryagin,Daniel Ochs,Phillip Deibert,Simon Kohaut,Devendra Singh Dhami,Kristian Kersting

from arxiv, 16 pages, 9 figures

Although Answer Set Programming (ASP) allows constraining neural-symbolic (NeSy) systems, its employment is hindered by the prohibitive costs of computing stable models and the CPU-bound nature of state-of-the-art solvers. To this end, we propose Answer Set Networks (ASN), a NeSy solver. Based on Graph Neural Networks (GNN), ASNs are a scalable approach to ASP-based Deep Probabilistic Logic Programming (DPPL). Specifically, we show how to translate ASPs into ASNs and demonstrate how ASNs can efficiently solve the encoded problem by leveraging GPU's batching and parallelization capabilities. Our experimental evaluations demonstrate that ASNs outperform state-of-the-art CPU-bound NeSy systems on multiple tasks. Simultaneously, we make the following two contributions based on the strengths of ASNs. Namely, we are the first to show the finetuning of Large Language Models (LLM) with DPPLs, employing ASNs to guide the training with logic. Further, we show the "constitutional navigation" of drones, i.e., encoding public aviation laws in an ASN for routing Unmanned Aerial Vehicles in uncertain environments.

詞元分析器 · 優化器 · MoDELS · 語言模型化 · Performer ·

2024 年 12 月 19 日

When Every Token Counts: Optimal Segmentation for Low-Resource Language Models

Bharath Raj S,Garvit Suri,Vikrant Dewangan,Raghav Sonavane

from arxiv, LoResLM @ COLING 2025

Traditional greedy tokenization methods have been a critical step in Natural Language Processing (NLP), influencing how text is converted into tokens and directly impacting model performance. While subword tokenizers like Byte-Pair Encoding (BPE) are widely used, questions remain about their optimality across model scales and languages. In this work, we demonstrate through extensive experiments that an optimal BPE configuration significantly reduces token count compared to greedy segmentation, yielding improvements in token-saving percentages and performance benefits, particularly for smaller models. We evaluate tokenization performance across various intrinsic and extrinsic tasks, including generation and classification. Our findings suggest that compression-optimized tokenization strategies could provide substantial advantages for multilingual and low-resource language applications, highlighting a promising direction for further research and inclusive NLP.

有向 · 向量化 · 控制器 · 分離的 · 層 ·

2024 年 12 月 19 日

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production

Shengeng Tang,Jiayi He,Dan Guo,Yanyan Wei,Feng Li,Richang Hong

from arxiv, Accepted by AAAI 2025

Sign Language Production (SLP) aims to generate semantically consistent sign videos from textual statements, where the conversion from textual glosses to sign poses (G2P) is a crucial step. Existing G2P methods typically treat sign poses as discrete three-dimensional coordinates and directly fit them, which overlooks the relative positional relationships among joints. To this end, we provide a new perspective, constraining joint associations and gesture details by modeling the limb bones to improve the accuracy and naturalness of the generated poses. In this work, we propose a pioneering iconicity disentangled diffusion framework, termed Sign-IDD, specifically designed for SLP. Sign-IDD incorporates a novel Iconicity Disentanglement (ID) module to bridge the gap between relative positions among joints. The ID module disentangles the conventional 3D joint representation into a 4D bone representation, comprising the 3D spatial direction vector and 1D spatial distance vector between adjacent joints. Additionally, an Attribute Controllable Diffusion (ACD) module is introduced to further constrain joint associations, in which the attribute separation layer aims to separate the bone direction and length attributes, and the attribute control layer is designed to guide the pose generation by leveraging the above attributes. The ACD module utilizes the gloss embeddings as semantic conditions and finally generates sign poses from noise embeddings. Extensive experiments on PHOENIX14T and USTC-CSL datasets validate the effectiveness of our method. The code is available at: //github.com/NaVi-start/Sign-IDD.

Automator · INTERACT · 多樣性 · motivation · 可辨認的 ·

2024 年 12 月 18 日

GUI Agents: A Survey

Dang Nguyen,Jian Chen,Yu Wang,Gang Wu,Namyong Park,Zhengmian Hu,Hanjia Lyu,Junda Wu,Ryan Aponte,Yu Xia,Xintong Li,Jing Shi,Hongjie Chen,Viet Dac Lai,Zhouhang Xie,Sungchul Kim,Ruiyi Zhang,Tong Yu,Mehrab Tanjim,Nesreen K. Ahmed,Puneet Mathur,Seunghyun Yoon,Lina Yao,Branislav Kveton,Thien Huu Nguyen,Trung Bui,Tianyi Zhou,Ryan A. Rossi,Franck Dernoncourt

Graphical User Interface (GUI) agents, powered by Large Foundation Models, have emerged as a transformative approach to automating human-computer interaction. These agents autonomously interact with digital systems or software applications via GUIs, emulating human actions such as clicking, typing, and navigating visual elements across diverse platforms. Motivated by the growing interest and fundamental importance of GUI agents, we provide a comprehensive survey that categorizes their benchmarks, evaluation metrics, architectures, and training methods. We propose a unified framework that delineates their perception, reasoning, planning, and acting capabilities. Furthermore, we identify important open challenges and discuss key future directions. Finally, this work serves as a basis for practitioners and researchers to gain an intuitive understanding of current progress, techniques, benchmarks, and critical open problems that remain to be addressed.

MoDELS · Performer · HTTPS · 監督 · 組合性 ·

2023 年 12 月 4 日

Data Management For Large Language Models: A Survey

Zige Wang,Wanjun Zhong,Yufei Wang,Qi Zhu,Fei Mi,Baojun Wang,Lifeng Shang,Xin Jiang,Qun Liu

from arxiv, Work in progress

Data plays a fundamental role in the training of Large Language Models (LLMs). Effective data management, particularly in the formulation of a well-suited training dataset, holds significance for enhancing model performance and improving training efficiency during pretraining and supervised fine-tuning phases. Despite the considerable importance of data management, the current research community still falls short in providing a systematic analysis of the rationale behind management strategy selection, its consequential effects, methodologies for evaluating curated datasets, and the ongoing pursuit of improved strategies. Consequently, the exploration of data management has attracted more and more attention among the research community. This survey provides a comprehensive overview of current research in data management within both the pretraining and supervised fine-tuning stages of LLMs, covering various noteworthy aspects of data management strategy design: data quantity, data quality, domain/task composition, etc. Looking toward the future, we extrapolate existing challenges and outline promising directions for development in this field. Therefore, this survey serves as a guiding resource for practitioners aspiring to construct powerful LLMs through effective data management practices. The collection of the latest papers is available at //github.com/ZigeW/data_management_LLM.

視覺問答 · 自動問答 · Extensibility · DATE · 數據集 ·

2021 年 11 月 19 日

Medical Visual Question Answering: A Survey

Zhihong Lin,Donghao Zhang,Qingyi Tac,Danli Shi,Gholamreza Haffari,Qi Wu,Mingguang He,Zongyuan Ge

Medical Visual Question Answering (VQA) is a combination of medical artificial intelligence and popular VQA challenges. Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to predict a plausible and convincing answer. Although the general-domain VQA has been extensively studied, the medical VQA still needs specific investigation and exploration due to its task features. In the first part of this survey, we cover and discuss the publicly available medical VQA datasets up to date about the data source, data quantity, and task feature. In the second part, we review the approaches used in medical VQA tasks. In the last part, we analyze some medical-specific challenges for the field and discuss future research directions.

圖 · 知識圖譜 · 鏈路預測 · Extensibility · entity ·

2020 年 10 月 6 日

CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

Tara Safavi,Danai Koutra

from arxiv, EMNLP 2020

We present CoDEx, a set of knowledge graph completion datasets extracted from Wikidata and Wikipedia that improve upon existing knowledge graph completion benchmarks in scope and level of difficulty. In terms of scope, CoDEx comprises three knowledge graphs varying in size and structure, multilingual descriptions of entities and relations, and tens of thousands of hard negative triples that are plausible but verified to be false. To characterize CoDEx, we contribute thorough empirical analyses and benchmarking experiments. First, we analyze each CoDEx dataset in terms of logical relation patterns. Next, we report baseline link prediction and triple classification results on CoDEx for five extensively tuned embedding models. Finally, we differentiate CoDEx from the popular FB15K-237 knowledge graph completion dataset by showing that CoDEx covers more diverse and interpretable content, and is a more difficult link prediction benchmark. Data, code, and pretrained models are available at //bit.ly/2EPbrJs.

Compositional GAN · INTERACT · MoDELS · 學成 · entity ·

2018 年 7 月 19 日

Compositional GAN: Learning Conditional Image Composition

Samaneh Azadi,Deepak Pathak,Sayna Ebrahimi,Trevor Darrell

Generative Adversarial Networks (GANs) can produce images of surprising complexity and realism, but are generally modeled to sample from a single latent source ignoring the explicit spatial interaction between multiple entities that could be present in a scene. Capturing such complex interactions between different objects in the world, including their relative scaling, spatial layout, occlusion, or viewpoint transformation is a challenging problem. In this work, we propose to model object composition in a GAN framework as a self-consistent composition-decomposition network. Our model is conditioned on the object images from their marginal distributions to generate a realistic image from their joint distribution by explicitly learning the possible interactions. We evaluate our model through qualitative experiments and user evaluations in both the scenarios when either paired or unpaired examples for the individual object images and the joint scenes are given during training. Our results reveal that the learned model captures potential interactions between the two object domains given as input to output new instances of composed scene at test time in a reasonable fashion.

FPGA · 卷積神經網絡 · Neural Networks · 卷積 · 層 ·

2016 年 9 月 30 日

Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks

Roberto DiCecco,Griffin Lacey,Jasmina Vasiljevic,Paul Chow,Graham Taylor,Shawki Areibi

Convolutional Neural Networks (CNNs) have gained significant traction in the field of machine learning, particularly due to their high accuracy in visual recognition. Recent works have pushed the performance of GPU implementations of CNNs to significantly improve their classification and training times. With these improvements, many frameworks have become available for implementing CNNs on both CPUs and GPUs, with no support for FPGA implementations. In this work we present a modified version of the popular CNN framework Caffe, with FPGA support. This allows for classification using CNN models and specialized FPGA implementations with the flexibility of reprogramming the device when necessary, seamless memory transactions between host and device, simple-to-use test benches, and the ability to create pipelined layer implementations. To validate the framework, we use the Xilinx SDAccel environment to implement an FPGA-based Winograd convolution engine and show that the FPGA layer can be used alongside other layers running on a host processor to run several popular CNNs (AlexNet, GoogleNet, VGG A, Overfeat). The results show that our framework achieves 50 GFLOPS across 3x3 convolutions in the benchmarks. This is achieved within a practical framework, which will aid in future development of FPGA-based CNNs.