成人午夜性影院视频-免费看黄色片

Although Large Language Models (LLMs) have demonstrated remarkable code-generation ability, they still struggle with complex tasks. In real-world software development, humans usually tackle complex tasks through collaborative teamwork, a strategy that significantly controls development complexity and enhances software quality. Inspired by this, we present a self-collaboration framework for code generation employing LLMs, exemplified by ChatGPT. Specifically, through role instructions, 1) Multiple LLM agents act as distinct `experts', each responsible for a specific subtask within a complex task; 2) Specify the way to collaborate and interact, so that different roles form a virtual team to facilitate each other's work, ultimately the virtual team addresses code generation tasks collaboratively without the need for human intervention. To effectively organize and manage this virtual team, we incorporate software-development methodology into the framework. Thus, we assemble an elementary team consisting of three LLM roles (i.e., analyst, coder, and tester) responsible for software development's analysis, coding, and testing stages. We conduct comprehensive experiments on various code-generation benchmarks. Experimental results indicate that self-collaboration code generation relatively improves 29.9%-47.1% Pass@1 compared to the base LLM agent. Moreover, we showcase that self-collaboration could potentially enable LLMs to efficiently handle complex repository-level tasks that are not readily solved by the single LLM agent.

相關內容

TEAM

關注 0

專利 · Analysis · TOOLS · AI · Taxonomy ·

2024 年 6 月 18 日

A Comprehensive Survey on AI-based Methods for Patents

Homaira Huda Shomee,Zhu Wang,Sathya N. Ravi,Sourav Medya

Recent advancements in Artificial Intelligence (AI) and machine learning have demonstrated transformative capabilities across diverse domains. This progress extends to the field of patent analysis and innovation, where AI-based tools present opportunities to streamline and enhance important tasks in the patent cycle such as classification, retrieval, and valuation prediction. This not only accelerates the efficiency of patent researchers and applicants but also opens new avenues for technological innovation and discovery. Our survey provides a comprehensive summary of recent AI tools in patent analysis from more than 40 papers from 26 venues between 2017 and 2023. Unlike existing surveys, we include methods that work for patent image and text data. Furthermore, we introduce a novel taxonomy for the categorization based on the tasks in the patent life cycle as well as the specifics of the AI methods. This interdisciplinary survey aims to serve as a resource for researchers and practitioners who are working at the intersection of AI and patent analysis as well as the patent offices that are aiming to build efficient patent systems.

FAST · NeRF · Processing（編程語言） · 估計/估計量 · Performer ·

2024 年 6 月 18 日

Fast Global Localization on Neural Radiance Field

Mangyu Kong,Seongwon Lee,Jaewon Lee,Euntai Kim

from arxiv, Preprint, Under review

Neural Radiance Fields (NeRF) presented a novel way to represent scenes, allowing for high-quality 3D reconstruction from 2D images. Following its remarkable achievements, global localization within NeRF maps is an essential task for enabling a wide range of applications. Recently, Loc-NeRF demonstrated a localization approach that combines traditional Monte Carlo Localization with NeRF, showing promising results for using NeRF as an environment map. However, despite its advancements, Loc-NeRF encounters the challenge of a time-intensive ray rendering process, which can be a significant limitation in practical applications. To address this issue, we introduce Fast Loc-NeRF, which leverages a coarse-to-fine approach to enable more efficient and accurate NeRF map-based global localization. Specifically, Fast Loc-NeRF matches rendered pixels and observed images on a multi-resolution from low to high resolution. As a result, it speeds up the costly particle update process while maintaining precise localization results. Additionally, to reject the abnormal particles, we propose particle rejection weighting, which estimates the uncertainty of particles by exploiting NeRF's characteristics and considers them in the particle weighting process. Our Fast Loc-NeRF sets new state-of-the-art localization performances on several benchmarks, convincing its accuracy and efficiency.

異常點 · 估計/估計量 · 模型選擇 · 極小點 · PCA ·

2024 年 6 月 17 日

High-dimensional Outlier Detection via Stability

Qiang Heng,Hui Shen,Kenneth Lange

The Minimum Covariance Determinant (MCD) method is a widely adopted tool for robust estimation and outlier detection. In this paper, we introduce a new framework for model selection in MCD with spectral embedding based on the notion of stability. Our best subset algorithm leverages principal component analysis for dimension reduction, statistical depths for effective initialization, and concentration steps for subset refinement. Subsequently, we construct a bootstrap procedure to estimate the instability of the best subset algorithm. The parameter combination exhibiting minimal instability proves ideal for the purposes of high-dimensional outlier detection, while the instability path offers insights into the inlier/outlier structure. We rigorously benchmark the proposed framework against existing MCD variants and illustrate its practical utility on two spectra data sets and a cancer genomics data set.

MoDELS · 估計/估計量 · Extensibility · state-of-the-art · 分離的 ·

2024 年 6 月 17 日

MOWA: Multiple-in-One Image Warping Model

Kang Liao,Zongsheng Yue,Zhonghua Wu,Chen Change Loy

from arxiv, Project page: //kangliao929.github.io/projects/mowa/

While recent image warping approaches achieved remarkable success on existing benchmarks, they still require training separate models for each specific task and cannot generalize well to different camera models or customized manipulations. To address diverse types of warping in practice, we propose a Multiple-in-One image WArping model (named MOWA) in this work. Specifically, we mitigate the difficulty of multi-task learning by disentangling the motion estimation at both the region level and pixel level. To further enable dynamic task-aware image warping, we introduce a lightweight point-based classifier that predicts the task type, serving as prompts to modulate the feature maps for more accurate estimation. To our knowledge, this is the first work that solves multiple practical warping tasks in one single model. Extensive experiments demonstrate that our MOWA, which is trained on six tasks for multiple-in-one image warping, outperforms state-of-the-art task-specific models across most tasks. Moreover, MOWA also exhibits promising potential to generalize into unseen scenes, as evidenced by cross-domain and zero-shot evaluations. The code and more visual results can be found on the project page: //kangliao929.github.io/projects/mowa/.

詞表 · MoDELS · 情景 · 詞元分析器 · 推斷 ·

2024 年 6 月 17 日

Vocabulary Expansion for Low-resource Cross-lingual Transfer

Atsuki Yamaguchi,Aline Villavicencio,Nikolaos Aletras

Large language models (LLMs) have shown remarkable capabilities in many languages beyond English. Yet, LLMs require more inference steps when generating non-English text due to their reliance on English-centric tokenizers, vocabulary, and pre-training data, resulting in higher usage costs to non-English speakers. Vocabulary expansion with target language tokens is a widely used cross-lingual vocabulary adaptation approach to remedy this issue. Despite its effectiveness in inference speedup, the majority of previous work has focused on high-resource settings assuming access to a substantial amount of target language data to effectively initialize the embeddings of the new tokens and adapt the LLM to the target language. However, vocabulary expansion for LLMs in low-resource settings (i.e. languages and compute) has yet to be explored. In this paper, we investigate sample-efficient adaptation strategies from different angles, including target vocabulary size and initialization methods, and the amount of target data available for adaptation. Extensive experiments across typologically diverse languages, tasks and models show that simpler heuristic-based embedding initialization is more efficient and robust to changes in target vocabulary size and adaptation data in low-resource settings, outperforming a popular random initialization and a more sophisticated state-of-the-art approach that relies on external data and model.

Networking · 塑造 · Extensibility · 估計/估計量 · 損失函數（機器學習） ·

2024 年 6 月 17 日

Video Frame Interpolation for Polarization via Swin-Transformer

Feng Huang,Xin Zhang,Yixuan Xu,Xuesong Wang,Xianyu Wu

from arxiv, 18 pages, 10 figures, 7 tables, 73 citations

Video Frame Interpolation (VFI) has been extensively explored and demonstrated, yet its application to polarization remains largely unexplored. Due to the selective transmission of light by polarized filters, longer exposure times are typically required to ensure sufficient light intensity, which consequently lower the temporal sample rates. Furthermore, because polarization reflected by objects varies with shooting perspective, focusing solely on estimating pixel displacement is insufficient to accurately reconstruct the intermediate polarization. To tackle these challenges, this study proposes a multi-stage and multi-scale network called Swin-VFI based on the Swin-Transformer and introduces a tailored loss function to facilitate the network's understanding of polarization changes. To ensure the practicality of our proposed method, this study evaluates its interpolated frames in Shape from Polarization (SfP) and Human Shape Reconstruction tasks, comparing them with other state-of-the-art methods such as CAIN, FLAVR, and VFIT. Experimental results demonstrate our approach's superior reconstruction accuracy across all tasks.

Networking · 周期的 · 優化器 · Neural Networks · 卷積神經網絡 ·

2024 年 6 月 17 日

Optimum signal duration for Human Activity Recognition based on Deep Convolutional Neural Networks

Farhad Nazari,Arian Shajari,Darius Nahavandi,Navid Mohajer

from arxiv, This paper is accepted for publication in the proceedings of the 18th Annual IEEE International Systems Conference (SysCon 2024) \c{opyright} 2024 IEEE. Personal use of this material is permitted

Human Activity Recognition (HAR) stands as a pivotal technique within pattern recognition, dedicated to deciphering human movements and actions utilizing one or multiple sensory inputs. Its significance extends across diverse applications, encompassing monitoring, security protocols, and the development of human-in-the-loop technologies. However, prevailing studies in HAR often overlook the integration of human-centered devices, wherein distinct parameters and criteria hold varying degrees of importance compared to other applications. Notably, within this realm, curtailing the sensor observation period assumes paramount importance to safeguard the efficiency of exoskeletons and prostheses. This study embarks on the optimization of this observation period specifically tailored for HAR using Inertial Measurement Unit (IMU) sensors. Employing a Deep Convolutional Neural Network (DCNN), the aim is to identify activities based on segments of IMU signals spanning durations from 0.1 to 4 seconds. Intriguingly, the outcomes spotlight an optimal observation duration of 0.5 seconds, showcasing an impressive classification accuracy of 99.95%. This revelation holds immense significance, elucidating the criticality of precise temporal analysis within HAR, particularly concerning human-centric devices. Such findings not only enhance our understanding of the optimal observation period but also lay the groundwork for refining the performance and efficacy of devices crucially relied upon for aiding human mobility and functionality.

語音翻譯 · MoDELS · 詞性標注 · Performer · Integration ·

2024 年 6 月 15 日

Lightweight Audio Segmentation for Long-form Speech Translation

Jaesong Lee,Soyoon Kim,Hanbyul Kim,Joon Son Chung

from arxiv, Accepted to Interspeech 2024

Speech segmentation is an essential part of speech translation (ST) systems in real-world scenarios. Since most ST models are designed to process speech segments, long-form audio must be partitioned into shorter segments before translation. Recently, data-driven approaches for the speech segmentation task have been developed. Although the approaches improve overall translation quality, a performance gap exists due to a mismatch between the models and ST systems. In addition, the prior works require large self-supervised speech models, which consume significant computational resources. In this work, we propose a segmentation model that achieves better speech translation quality with a small model size. We propose an ASR-with-punctuation task as an effective pre-training strategy for the segmentation model. We also show that proper integration of the speech segmentation model into the underlying ST system is critical to improve overall translation quality at inference time.

entity · 標注 · 演繹推理 · Networking · Performer ·

2021 年 9 月 13 日

Fine-grained Entity Typing via Label Reasoning

Qing Liu,Hongyu Lin,Xinyan Xiao,Xianpei Han,Le Sun,Hua Wu

from arxiv, Accepted to the main conference of EMNLP2021

Conventional entity typing approaches are based on independent classification paradigms, which make them difficult to recognize inter-dependent, long-tailed and fine-grained entity types. In this paper, we argue that the implicitly entailed extrinsic and intrinsic dependencies between labels can provide critical knowledge to tackle the above challenges. To this end, we propose \emph{Label Reasoning Network(LRN)}, which sequentially reasons fine-grained entity labels by discovering and exploiting label dependencies knowledge entailed in the data. Specifically, LRN utilizes an auto-regressive network to conduct deductive reasoning and a bipartite attribute graph to conduct inductive reasoning between labels, which can effectively model, learn and reason complex label dependencies in a sequence-to-set, end-to-end manner. Experiments show that LRN achieves the state-of-the-art performance on standard ultra fine-grained entity typing benchmarks, and can also resolve the long tail label problem effectively.

小樣本學習 · 注意力機制 · 圖形處理器 · GNN · 學成 ·

2020 年 7 月 14 日

Attentive Graph Neural Networks for Few-Shot Learning

Hao Cheng,Joey Tianyi Zhou,Wee Peng Tay,Bihan Wen

Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks. Despite its powerful capacity to learn and generalize from few samples, GNN usually suffers from severe over-fitting and over-smoothing as the model becomes deep, which limit the model scalability. In this work, we propose a novel Attentive GNN to tackle these challenges, by incorporating a triple-attention mechanism, \ie node self-attention, neighborhood attention, and layer memory attention. We explain why the proposed attentive modules can improve GNN for few-shot learning with theoretical analysis and illustrations. Extensive experiments show that the proposed Attentive GNN outperforms the state-of-the-art GNN-based methods for few-shot learning over the mini-ImageNet and Tiered-ImageNet datasets, with both inductive and transductive settings.