亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='B2kUe'></li>

_{^{<dd id='XpD2r'><tbody id='7Lj86'><td id='wz10e'><optgroup id='P0C2i'><strong id='36PrN'></strong></optgroup><address id='q8zSn'><ul id='gkYHR'></ul></address><big id='WP7QG'></big></td><table id='9wgMJ'></table></tbody><pre id='PO5K2'></pre></dd><span id='OtiQi'><b id='3Jlnb'></b></span>}}


<dfn id='DOQdW'><optgroup id='PG6fE'></optgroup></dfn><tfoot id='6yGSu'><bdo id='IAe70'><div id='YJllu'></div><i id='W5BZe'><dt id='WlkVX'></dt></i></bdo></tfoot>

_{<fieldset id='EQprb'></fieldset>}

·

Learning · 機器人 · 多樣性 · Engineering · 設計 ·

2024 年 7 月 7 日

ClutterGen: A Cluttered Scene Generator for Robot Learning

Yinsen Jia,Boyuan Chen

We introduce ClutterGen, a physically compliant simulation scene generator capable of producing highly diverse, cluttered, and stable scenes for robot learning. Generating such scenes is challenging as each object must adhere to physical laws like gravity and collision. As the number of objects increases, finding valid poses becomes more difficult, necessitating significant human engineering effort, which limits the diversity of the scenes. To overcome these challenges, we propose a reinforcement learning method that can be trained with physics-based reward signals provided by the simulator. Our experiments demonstrate that ClutterGen can generate cluttered object layouts with up to ten objects on confined table surfaces. Additionally, our policy design explicitly encourages the diversity of the generated scenes for open-ended generation. Our real-world robot results show that ClutterGen can be directly used for clutter rearrangement and stable placement policy training.

相關內容

Learning

MoDELS · Learning · INTERACT · Machine Learning · Subspace ·

2024 年 8 月 22 日

Neural-ANOVA: Model Decomposition for Interpretable Machine Learning

Steffen Limmer,Steffen Udluft,Clemens Otte

from arxiv, 8 pages, 4 figures, 5 tables

The analysis of variance (ANOVA) decomposition offers a systematic method to understand the interaction effects that contribute to a specific decision output. In this paper we introduce Neural-ANOVA, an approach to decompose neural networks into glassbox models using the ANOVA decomposition. Our approach formulates a learning problem, which enables rapid and closed-form evaluation of integrals over subspaces that appear in the calculation of the ANOVA decomposition. Finally, we conduct numerical experiments to illustrate the advantages of enhanced interpretability and model validation by a decomposition of the learned interaction effects.

塑造 · 帶符號距離 · Microsoft Surface · Networking · Neural Networks ·

2024 年 8 月 21 日

HYVE: Hybrid Vertex Encoder for Neural Distance Fields

Stefan Rhys Jeske,Jonathan Klein,Dominik L. Michels,Jan Bender

Neural shape representation generally refers to representing 3D geometry using neural networks, e.g., computing a signed distance or occupancy value at a specific spatial position. In this paper we present a neural-network architecture suitable for accurate encoding of 3D shapes in a single forward pass. Our architecture is based on a multi-scale hybrid system incorporating graph-based and voxel-based components, as well as a continuously differentiable decoder. The hybrid system includes a novel way of voxelizing point-based features in neural networks, which we show can be used in combination with oriented point-clouds to obtain smoother and more detailed reconstructions. Furthermore, our network is trained to solve the eikonal equation and only requires knowledge of the zero-level set for training and inference. This means that in contrast to most previous shape encoder architectures, our network is able to output valid signed distance fields without explicit prior knowledge of non-zero distance values or shape occupancy. It also requires only a single forward-pass, instead of the latent-code optimization used in auto-decoder methods. We further propose a modification to the loss function in case that surface normals are not well defined, e.g., in the context of non-watertight surfaces and non-manifold geometry, resulting in an unsigned distance field. Overall, our system can help to reduce the computational overhead of training and evaluating neural distance fields, as well as enabling the application to difficult geometry.

多峰值 · 情景 · Extensibility · Integration · 評論員 ·

2024 年 8 月 21 日

MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

Xuannan Liu,Zekun Li,Peipei Li,Shuhan Xia,Xing Cui,Linzhi Huang,Huaibo Huang,Weihong Deng,Zhaofeng He

from arxiv, Project page: //liuxuannan.github.io/MMFakeBench.github.io/

Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist. The lack of a benchmark for mixed-source misinformation has hindered progress in this field. To address this, we introduce MMFakeBench, the first comprehensive benchmark for mixed-source MMD. MMFakeBench includes 3 critical sources: textual veracity distortion, visual veracity distortion, and cross-modal consistency distortion, along with 12 sub-categories of misinformation forgery types. We further conduct an extensive evaluation of 6 prevalent detection methods and 15 large vision-language models (LVLMs) on MMFakeBench under a zero-shot setting. The results indicate that current methods struggle under this challenging and realistic mixed-source MMD setting. Additionally, we propose an innovative unified framework, which integrates rationales, actions, and tool-use capabilities of LVLM agents, significantly enhancing accuracy and generalization. We believe this study will catalyze future research into more realistic mixed-source multimodal misinformation and provide a fair evaluation of misinformation detection methods.

Networking · MoDELS · CTR · INTERACT · Next ·

2024 年 8 月 9 日

DCNv3: Towards Next Generation Deep Cross Network for CTR Prediction

Honghao Li,Yiwen Zhang,Yi Zhang,Hanwei Li,Lei Sang,Jieming Zhu

Deep & Cross Network and its derivative models have become an important paradigm for click-through rate (CTR) prediction due to their effective balance between computational cost and performance. However, these models face four major limitations: (1) the performance of existing explicit feature interaction methods is often weaker than that of implicit deep neural network (DNN), undermining their necessity; (2) many models fail to adaptively filter noise while increasing the order of feature interactions; (3) the fusion methods of most models cannot provide suitable supervision signals for their different sub-networks; (4) while most models claim to capture high-order feature interactions, they often do so implicitly and non-interpretably through DNN, which limits the trustworthiness of the model's predictions. To address the identified limitations, this paper proposes the next generation deep cross network: Deep Cross Network v3 (DCNv3), along with its two sub-networks: Linear Cross Network (LCN) and Exponential Cross Network (ECN) for CTR prediction. DCNv3 ensures interpretability in feature interaction modeling while linearly and exponentially increasing the order of feature interactions to achieve genuine Deep Crossing rather than just Deep & Cross. Additionally, we employ a Self-Mask operation to filter noise and reduce the number of parameters in the Cross Network by half. In the fusion layer, we use a simple yet effective multi-loss trade-off and calculation method, called Tri-BCE, to provide appropriate supervision signals. Comprehensive experiments on six datasets demonstrate the effectiveness, efficiency, and interpretability of DCNv3. The code, running logs, and detailed hyperparameter configurations are available at: //github.com/salmon1802/DCNv3.

圖 · 泛化理論 · Performer · MoDELS · 有偏 ·

2024 年 8 月 8 日

DIVE: Subgraph Disagreement for Graph Out-of-Distribution Generalization

Xin Sun,Liang Wang,Qiang Liu,Shu Wu,Zilei Wang,Liang Wang

This paper addresses the challenge of out-of-distribution (OOD) generalization in graph machine learning, a field rapidly advancing yet grappling with the discrepancy between source and target data distributions. Traditional graph learning algorithms, based on the assumption of uniform distribution between training and test data, falter in real-world scenarios where this assumption fails, resulting in suboptimal performance. A principal factor contributing to this suboptimal performance is the inherent simplicity bias of neural networks trained through Stochastic Gradient Descent (SGD), which prefer simpler features over more complex yet equally or more predictive ones. This bias leads to a reliance on spurious correlations, adversely affecting OOD performance in various tasks such as image recognition, natural language understanding, and graph classification. Current methodologies, including subgraph-mixup and information bottleneck approaches, have achieved partial success but struggle to overcome simplicity bias, often reinforcing spurious correlations. To tackle this, we propose DIVE, training a collection of models to focus on all label-predictive subgraphs by encouraging the models to foster divergence on the subgraph mask, which circumvents the limitation of a model solely focusing on the subgraph corresponding to simple structural patterns. Specifically, we employs a regularizer to punish overlap in extracted subgraphs across models, thereby encouraging different models to concentrate on distinct structural patterns. Model selection for robust OOD performance is achieved through validation accuracy. Tested across four datasets from GOOD benchmark and one dataset from DrugOOD benchmark, our approach demonstrates significant improvement over existing methods, effectively addressing the simplicity bias and enhancing generalization in graph machine learning.

Automator · Performer · 設計 · 優化器 · 變換 ·

2024 年 8 月 6 日

INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers

Souradip Poddar,Youngmin Oh,Yao Lai,Hanqing Zhu,Bosun Hwang,David Z. Pan

Analog front-end design heavily relies on specialized human expertise and costly trial-and-error simulations, which motivated many prior works on analog design automation. However, efficient and effective exploration of the vast and complex design space remains constrained by the time-consuming nature of SPICE simulations, making effective design automation a challenging endeavor. In this paper, we introduce INSIGHT, a GPU-powered, technology-agnostic, effective universal neural simulator in the analog front-end design automation loop. INSIGHT accurately predicts the performance metrics of analog circuits across various technologies with just a few microseconds of inference time. Notably, its autoregressive capabilities enable INSIGHT to accurately predict simulation-costly critical transient specifications leveraging less expensive performance metric information. The low cost and high fidelity feature make INSIGHT a good substitute for standard simulators in analog front-end optimization frameworks. INSIGHT is compatible with any optimization framework, facilitating enhanced design space exploration for sample efficiency through sophisticated offline learning and adaptation techniques. Our experiments demonstrate that INSIGHT-M, a model-based batch reinforcement learning sizing framework with INSIGHT as the accurate surrogate, only requires < 20 real-time simulations with 100-1000x lower simulation costs and significant speedup over existing sizing methods.

Performer · 大語言模型 · 可約的 · Continuity · 局部線性嵌入 ·

2024 年 8 月 6 日

HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection

Yuxin Wang,Duanyu Feng,Yongfu Dai,Zhengyu Chen,Jimin Huang,Sophia Ananiadou,Qianqian Xie,Hao Wang

Data serves as the fundamental foundation for advancing deep learning, particularly tabular data presented in a structured format, which is highly conducive to modeling. However, even in the era of LLM, obtaining tabular data from sensitive domains remains a challenge due to privacy or copyright concerns. Hence, exploring how to effectively use models like LLMs to generate realistic and privacy-preserving synthetic tabular data is urgent. In this paper, we take a step forward to explore LLMs for tabular data synthesis and privacy protection, by introducing a new framework HARMONIC for tabular data generation and evaluation. In the tabular data generation of our framework, unlike previous small-scale LLM-based methods that rely on continued pre-training, we explore the larger-scale LLMs with fine-tuning to generate tabular data and enhance privacy. Based on idea of the k-nearest neighbors algorithm, an instruction fine-tuning dataset is constructed to inspire LLMs to discover inter-row relationships. Then, with fine-tuning, LLMs are trained to remember the format and connections of the data rather than the data itself, which reduces the risk of privacy leakage. In the evaluation part of our framework, we develop specific privacy risk metrics DLT for LLM synthetic data generation, as well as performance evaluation metrics LLE for downstream LLM tasks. Our experiments find that this tabular data generation framework achieves equivalent performance to existing methods with better privacy, which also demonstrates our evaluation framework for the effectiveness of synthetic data and privacy risks in LLM scenarios.

MoDELS · CASES · Integration · 語言模型化 · 大語言模型 ·

2024 年 8 月 5 日

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Daniel Fleischer,Moshe Berchansky,Moshe Wasserblat,Peter Izsak

from arxiv, 10 pages

Implementing Retrieval-Augmented Generation (RAG) systems is inherently complex, requiring deep understanding of data, use cases, and intricate design decisions. Additionally, evaluating these systems presents significant challenges, necessitating assessment of both retrieval accuracy and generative quality through a multi-faceted approach. We introduce RAG Foundry, an open-source framework for augmenting large language models for RAG use cases. RAG Foundry integrates data creation, training, inference and evaluation into a single workflow, facilitating the creation of data-augmented datasets for training and evaluating large language models in RAG settings. This integration enables rapid prototyping and experimentation with various RAG techniques, allowing users to easily generate datasets and train RAG models using internal or specialized knowledge sources. We demonstrate the framework effectiveness by augmenting and fine-tuning Llama-3 and Phi-3 models with diverse RAG configurations, showcasing consistent improvements across three knowledge-intensive datasets. Code is released as open-source in //github.com/IntelLabs/RAGFoundry.

大語言模型 · 可理解性 · MoDELS · 可辨認的 · Extensibility ·

2024 年 8 月 3 日

MarkLLM: An Open-Source Toolkit for LLM Watermarking

Leyi Pan,Aiwei Liu,Zhiwei He,Zitian Gao,Xuandong Zhao,Yijian Lu,Binglin Zhou,Shuliang Liu,Xuming Hu,Lijie Wen,Irwin King,Philip S. Yu

from arxiv, 17 pages, 5 figures, 6 tables

LLM watermarking, which embeds imperceptible yet algorithmically detectable signals in model outputs to identify LLM-generated text, has become crucial in mitigating the potential misuse of large language models. However, the abundance of LLM watermarking algorithms, their intricate mechanisms, and the complex evaluation procedures and perspectives pose challenges for researchers and the community to easily experiment with, understand, and assess the latest advancements. To address these issues, we introduce MarkLLM, an open-source toolkit for LLM watermarking. MarkLLM offers a unified and extensible framework for implementing LLM watermarking algorithms, while providing user-friendly interfaces to ensure ease of access. Furthermore, it enhances understanding by supporting automatic visualization of the underlying mechanisms of these algorithms. For evaluation, MarkLLM offers a comprehensive suite of 12 tools spanning three perspectives, along with two types of automated evaluation pipelines. Through MarkLLM, we aim to support researchers while improving the comprehension and involvement of the general public in LLM watermarking technology, fostering consensus and driving further advancements in research and application. Our code is available at //github.com/THU-BPM/MarkLLM.

state-of-the-art · 可理解性 · BERT · 去噪自編碼器 · Performer ·

2019 年 6 月 19 日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang,Zihang Dai,Yiming Yang,Jaime Carbonell,Ruslan Salakhutdinov,Quoc V. Le

from arxiv, Pretrained models and code are available at //github.com/zihangdai/xlnet

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we propose XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining. Empirically, XLNet outperforms BERT on 20 tasks, often by a large margin, and achieves state-of-the-art results on 18 tasks including question answering, natural language inference, sentiment analysis, and document ranking.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='dxwuZ'></tfoot>

<legend id='rEsbu'><style id='6OADc'><dir id='kkNTo'><q id='4v8Vc'></q></dir></style></legend>

<i id='8E1Mh'><tr id='Es5gV'><dt id='NhJcx'><q id='oznq4'><span id='Z1h4m'><b id='Myar6'><form id='duoNV'><ins id='kxIUL'></ins><ul id='11bqd'></ul><sub id='GRpDm'></sub></form><legend id='rlPf6'></legend><bdo id='gx7AE'><pre id='fsNkR'><center id='t36qn'></center></pre></bdo></b><th id='21DPc'></th></span></q></dt></tr></i><div id='BR0c7'><tfoot id='rGgTJ'></tfoot><dl id='7VJGZ'><fieldset id='OUnLA'></fieldset></dl></div>

<li id='0qdo4'><abbr id='ZdyuD'></abbr></li>