亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='NDrjM'><strong id='GiCgE'></strong><small id='bsTwb'></small><button id='FILWI'></button><li id='GuDRd'><noscript id='FO2Sx'><big id='0qC8w'></big><dt id='4Bw1W'></dt></noscript></li></tr><ol id='P449K'><option id='Q6LqV'><table id='fzoUH'><blockquote id='saJfh'><tbody id='2rUTE'></tbody></blockquote></table></option></ol><u id='UZVah'></u><kbd id='emVa4'><kbd id='MBEhw'></kbd></kbd>

<code id='Ltr9k'><strong id='facW5'></strong></code>

<fieldset id='xaZDD'></fieldset>

<span id='MwV1a'></span>

<ins id='1ZhOz'></ins>

<acronym id='C1CIo'><em id='fhZlZ'></em><td id='uV6EE'><div id='TyhjU'></div></td></acronym><address id='QF6bK'><big id='ERr8o'><big id='s4wLw'></big><legend id='zGKDi'></legend></big></address>

<i id='reac8'><div id='kD4kS'><ins id='vzIIl'></ins></div></i>

<i id='Cibgl'></i>

·

生成式人工智能 · MoDELS · AI · AIM · Integration ·

2024 年 2 月 13 日

Computational Copyright: Towards A Royalty Model for Music Generative AI

Junwei Deng,Jiaqi Ma

The advancement of generative AI has given rise to pressing copyright challenges, particularly in music industry. This paper focuses on the economic aspects of these challenges, emphasizing that the economic impact constitutes a central issue in the copyright arena. The complexity of the black-box generative AI technologies not only suggests but necessitates algorithmic solutions. However, such solutions have been largely missing, leading to regulatory challenges in this landscape. We aim to bridge the gap in current approaches by proposing potential royalty models for revenue sharing on AI music generation platforms. Our methodology involves a detailed analysis of existing royalty models in platforms like Spotify and YouTube, and adapting these to the unique context of AI-generated music. A significant challenge we address is the attribution of AI-generated music to influential copyrighted content in the training data. To this end, we present algorithmic solutions employing data attribution techniques. Our experimental results verify the effectiveness of these solutions. This research represents a pioneering effort in integrating technical advancements with economic and legal considerations in the field of generative AI, offering a computational copyright solution for the challenges posed by the opaque nature of AI technologies.

相關內容

生成式人工智能

生成式人工智能

生成式人工智能是利用復雜的算法、模型和規則，從大規模數據集中學習，以創造新的原創內容的人工智能技術。這項技術能夠創造文本、圖片、聲音、視頻和代碼等多種類型的內容，全面超越了傳統軟件的數據處理和分析能力。2022年末，OpenAI推出的ChatGPT標志著這一技術在文本生成領域取得了顯著進展，2023年被稱為生成式人工智能的突破之年。這項技術從單一的語言生成逐步向多模態、具身化快速發展。在圖像生成方面，生成系統在解釋提示和生成逼真輸出方面取得了顯著的進步。同時，視頻和音頻的生成技術也在迅速發展，這為虛擬現實和元宇宙的實現提供了新的途徑。生成式人工智能技術在各行業、各領域都具有廣泛的應用前景。

MoDELS · 回合 · Performer · CASES · 數據集 ·

2024 年 3 月 26 日

Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Francesca Ronchini,Luca Comanducci,Fabio Antonacci

from arxiv, Submitted to EUSIPCO 2024

In the past few years, text-to-audio models have emerged as a significant advancement in automatic audio gener- ation. Although they represent impressive technological progress, the effectiveness of their use in the development of audio applications remains uncertain. This paper aims to investigate these aspects, specifically focusing on the task of classification of environmental sounds. This study analyzes the performance of two different environmental classification systems when data generated from text-to-audio models is used for training. Two cases are considered: a) when the training dataset is augmented by data coming from two different text-to-audio models; and b) when the training dataset consists solely of synthetic audio generated. In both cases, the performance of the classification task is tested on real data. Results indicate that text-to-audio models are effective for dataset augmentation, whereas the performance of the models drops when relying on only generated audio.

CCS · 控制器 · 估計/估計量 · Fences · 講稿 ·

2024 年 3 月 26 日

Brokenwire : Wireless Disruption of CCS Electric Vehicle Charging

Sebastian K?hler,Richard Baker,Martin Strohmeier,Ivan Martinovic

We present a novel attack against the Combined Charging System, one of the most widely used DC rapid charging technologies for electric vehicles (EVs). Our attack, Brokenwire, interrupts necessary control communication between the vehicle and charger, causing charging sessions to abort. The attack requires only temporary physical proximity and can be conducted wirelessly from a distance, allowing individual vehicles or entire fleets to be disrupted stealthily and simultaneously. In addition, it can be mounted with off-the-shelf radio hardware and minimal technical knowledge. By exploiting CSMA/CA behavior, only a very weak signal needs to be induced into the victim to disrupt communication - exceeding the effectiveness of broadband noise jamming by three orders of magnitude. The exploited behavior is a required part of the HomePlug Green PHY, DIN 70121 & ISO 15118 standards and all known implementations exhibit it. We first study the attack in a controlled testbed and then demonstrate it against eight vehicles and 20 chargers in real deployments. We find the attack to be successful in the real world, at ranges up to 47 m, for a power budget of less than 1 W. We further show that the attack can work between the floors of a building (e.g., multi-story parking), through perimeter fences, and from `drive-by' attacks. We present a heuristic model to estimate the number of vehicles that can be attacked simultaneously for a given output power. Brokenwire has immediate implications for a substantial proportion of the around 12 million battery EVs on the roads worldwide - and profound effects on the new wave of electrification for vehicle fleets, both for private enterprise and crucial public services, as well as electric buses, trucks and small ships. As such, we conducted a disclosure to the industry and discussed a range of mitigation techniques that could be deployed to limit the impact.

模型評估 · MoDELS · 機器學習建模 · 數據集 · ML ·

2024 年 3 月 26 日

FedCSD: A Federated Learning Based Approach for Code-Smell Detection

Sadi Alawadi,Khalid Alkharabsheh,Fahed Alkhabbas,Victor Kebande,Feras M. Awaysheh,Fabio Palomba,Mohammed Awad

from arxiv, 17 pages, 7 figures, Journal paper

This paper proposes a Federated Learning Code Smell Detection (FedCSD) approach that allows organizations to collaboratively train federated ML models while preserving their data privacy. These assertions have been supported by three experiments that have significantly leveraged three manually validated datasets aimed at detecting and examining different code smell scenarios. In experiment 1, which was concerned with a centralized training experiment, dataset two achieved the lowest accuracy (92.30%) with fewer smells, while datasets one and three achieved the highest accuracy with a slight difference (98.90% and 99.5%, respectively). This was followed by experiment 2, which was concerned with cross-evaluation, where each ML model was trained using one dataset, which was then evaluated over the other two datasets. Results from this experiment show a significant drop in the model's accuracy (lowest accuracy: 63.80\%) where fewer smells exist in the training dataset, which has a noticeable reflection (technical debt) on the model's performance. Finally, the last and third experiments evaluate our approach by splitting the dataset into 10 companies. The ML model was trained on the company's site, then all model-updated weights were transferred to the server. Ultimately, an accuracy of 98.34% was achieved by the global model that has been trained using 10 companies for 100 training rounds. The results reveal a slight difference in the global model's accuracy compared to the highest accuracy of the centralized model, which can be ignored in favour of the global model's comprehensive knowledge, lower training cost, preservation of data privacy, and avoidance of the technical debt problem.

Networking · Extensibility · Guidance · 相關系數 · 塊 ·

2024 年 3 月 26 日

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

Ronghui Li,YuXiang Zhang,Yachao Zhang,Hongwen Zhang,Jie Guo,Yan Zhang,Yebin Liu,Xiu Li

from arxiv, Accepted by CVPR2024, Project page: //li-ronghui.github.io/lodge

We propose Lodge, a network capable of generating extremely long dance sequences conditioned on given music. We design Lodge as a two-stage coarse to fine diffusion architecture, and propose the characteristic dance primitives that possess significant expressiveness as intermediate representations between two diffusion models. The first stage is global diffusion, which focuses on comprehending the coarse-level music-dance correlation and production characteristic dance primitives. In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules. In addition, we propose a Foot Refine Block to optimize the contact between the feet and the ground, enhancing the physical realism of the motion. Our approach can parallelly generate dance sequences of extremely long length, striking a balance between global choreographic patterns and local motion quality and expressiveness. Extensive experiments validate the efficacy of our method.

變換 · 講稿 · HTTPS · 向量化 · 端到端 ·

2024 年 3 月 25 日

Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion

Omar Moured,Morris Baumgarten-Egemole,Alina Roitberg,Karin Muller,Thorsten Schwarz,Rainer Stiefelhagen

from arxiv, Accepted to IUI 2024. 19 pages, 7 figures, 2 table. For a demo video, see this //moured.github.io/chart4blind/ . The source code is available at //github.com/moured/chart4blind_code/

In a world driven by data visualization, ensuring the inclusive accessibility of charts for Blind and Visually Impaired (BVI) individuals remains a significant challenge. Charts are usually presented as raster graphics without textual and visual metadata needed for an equivalent exploration experience for BVI people. Additionally, converting these charts into accessible formats requires considerable effort from sighted individuals. Digitizing charts with metadata extraction is just one aspect of the issue; transforming it into accessible modalities, such as tactile graphics, presents another difficulty. To address these disparities, we propose Chart4Blind, an intelligent user interface that converts bitmap image representations of line charts into universally accessible formats. Chart4Blind achieves this transformation by generating Scalable Vector Graphics (SVG), Comma-Separated Values (CSV), and alternative text exports, all comply with established accessibility standards. Through interviews and a formal user study, we demonstrate that even inexperienced sighted users can make charts accessible in an average of 4 minutes using Chart4Blind, achieving a System Usability Scale rating of 90%. In comparison to existing approaches, Chart4Blind provides a comprehensive solution, generating end-to-end accessible SVGs suitable for assistive technologies such as embossed prints (papers and laser cut), 2D tactile displays, and screen readers. For additional information, including open-source codes and demos, please visit our project page //moured.github.io/chart4blind/.

覆蓋 · Analysis · 大語言模型 · Branch · 中位數 ·

2024 年 3 月 24 日

CoverUp: Coverage-Guided LLM-Based Test Generation

Juan Altmayer Pizzorno,Emery D. Berger

from arxiv, 11 pages

This paper presents CoverUp, a novel system that drives the generation of high-coverage Python regression tests via a combination of coverage analysis and large-language models (LLMs). CoverUp iteratively improves coverage, interleaving coverage analysis with dialogs with the LLM to focus its attention on as yet uncovered lines and branches. The resulting test suites significantly improve coverage over the current state of the art: compared to CodaMosa, a hybrid LLM / search-based software testing system, CoverUp substantially improves coverage across the board. On a per-module basis, CoverUp achieves median line coverage of 81% (vs. 62%), branch coverage of 53% (vs. 35%) and line+branch coverage of 78% (vs. 55%). We show that CoverUp's iterative, coverage-guided approach is crucial to its effectiveness, contributing to nearly half of its successes.

LIDAR · SLAM · 估計/估計量 · 損失函數（機器學習） · 泛函 ·

2024 年 3 月 23 日

LONER: LiDAR Only Neural Representations for Real-Time SLAM

Seth Isaacson,Pou-Chun Kung,Mani Ramanagopal,Ram Vasudevan,Katherine A. Skinner

from arxiv, First two authors equally contributed. Webpage: //umautobots.github.io/loner

This paper proposes LONER, the first real-time LiDAR SLAM algorithm that uses a neural implicit scene representation. Existing implicit mapping methods for LiDAR show promising results in large-scale reconstruction, but either require groundtruth poses or run slower than real-time. In contrast, LONER uses LiDAR data to train an MLP to estimate a dense map in real-time, while simultaneously estimating the trajectory of the sensor. To achieve real-time performance, this paper proposes a novel information-theoretic loss function that accounts for the fact that different regions of the map may be learned to varying degrees throughout online training. The proposed method is evaluated qualitatively and quantitatively on two open-source datasets. This evaluation illustrates that the proposed loss function converges faster and leads to more accurate geometry reconstruction than other loss functions used in depth-supervised neural implicit frameworks. Finally, this paper shows that LONER estimates trajectories competitively with state-of-the-art LiDAR SLAM methods, while also producing dense maps competitive with existing real-time implicit mapping methods that use groundtruth poses.

Microsoft Surface · 塑造 · MoDELS · Learning · 潛在 ·

2024 年 3 月 23 日

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

Zhengming Yu,Zhiyang Dou,Xiaoxiao Long,Cheng Lin,Zekun Li,Yuan Liu,Norman Müller,Taku Komura,Marc Habermann,Christian Theobalt,Xin Li,Wenping Wang

from arxiv, Project Page: //yzmblog.github.io/projects/SurfD/

We present Surf-D, a novel method for generating high-quality 3D shapes as Surfaces with arbitrary topologies using Diffusion models. Previous methods explored shape generation with different representations and they suffer from limited topologies and poor geometry details. To generate high-quality surfaces of arbitrary topologies, we use the Unsigned Distance Field (UDF) as our surface representation to accommodate arbitrary topologies. Furthermore, we propose a new pipeline that employs a point-based AutoEncoder to learn a compact and continuous latent space for accurately encoding UDF and support high-resolution mesh extraction. We further show that our new pipeline significantly outperforms the prior approaches to learning the distance fields, such as the grid-based AutoEncoder, which is not scalable and incapable of learning accurate UDF. In addition, we adopt a curriculum learning strategy to efficiently embed various surfaces. With the pretrained shape latent space, we employ a latent diffusion model to acquire the distribution of various shapes. Extensive experiments are presented on using Surf-D for unconditional generation, category conditional generation, image conditional generation, and text-to-shape tasks. The experiments demonstrate the superior performance of Surf-D in shape generation across multiple modalities as conditions. Visit our project page at //yzmblog.github.io/projects/SurfD/.

MoDELS · 可理解性 · Performer · 任務對話系統 · 縮放 ·

2024 年 3 月 22 日

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Yi Wang,Kunchang Li,Xinhao Li,Jiashuo Yu,Yinan He,Guo Chen,Baoqi Pei,Rongkun Zheng,Jilan Xu,Zun Wang,Yansong Shi,Tianxiang Jiang,Songze Li,Hongjie Zhang,Yifei Huang,Yu Qiao,Yali Wang,Limin Wang

from arxiv, a technical report about video understanding

We introduce InternVideo2, a new video foundation model (ViFM) that achieves the state-of-the-art performance in action recognition, video-text tasks, and video-centric dialogue. Our approach employs a progressive training paradigm that unifies the different self- or weakly-supervised learning frameworks of masked video token reconstruction, cross-modal contrastive learning, and next token prediction. Different training stages would guide our model to capture different levels of structure and semantic information through different pretext tasks. At the data level, we prioritize the spatiotemporal consistency by semantically segmenting videos and generating video-audio-speech captions. This improves the alignment between video and text. We scale both data and model size for our InternVideo2. Through extensive experiments, we validate our designs and demonstrate the state-of-the-art performance on over 60 video and audio tasks. Notably, our model outperforms others on various video-related captioning, dialogue, and long video understanding benchmarks, highlighting its ability to reason and comprehend long temporal contexts. Code and models are available at //github.com/OpenGVLab/InternVideo2/.

INTERACT · MoDELS · Stable Diffusion · 詞元分析器 · 模型評估 ·

2024 年 3 月 22 日

Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation

Yang Li,Songlin Yang,Wei Wang,Jing Dong

from arxiv, 14 pages, 16 figures

Advanced diffusion-based Text-to-Image (T2I) models, such as the Stable Diffusion Model, have made significant progress in generating diverse and high-quality images using text prompts alone. However, when non-famous users require personalized image generation for their identities (IDs), the T2I models fail to accurately generate their ID-related images. The main problem is that pre-trained T2I models do not learn the mapping between the new ID prompts and their corresponding visual content. The previous methods either failed to accurately fit the face region or lost the interactive generative ability with other existing concepts in T2I models. In other words, they are unable to generate T2I-aligned and semantic-fidelity images for the given prompts with other concepts such as scenes (``Eiffel Tower''), actions (``holding a basketball''), and facial attributes (``eyes closed''). In this paper, we focus on inserting accurate and interactive ID embedding into the Stable Diffusion Model for semantic-fidelity personalized generation. We address this challenge from two perspectives: face-wise region fitting and semantic-fidelity token optimization. Specifically, we first visualize the attention overfit problem and propose a face-wise attention loss to fit the face region instead of entangling ID-unrelated information, such as face layout and background. This key trick significantly enhances the ID accuracy and interactive generative ability with other existing concepts. Then, we optimize one ID representation as multiple per-stage tokens where each token contains two disentangled features. This expansion of the textual conditioning space improves semantic-fidelity control. Extensive experiments validate that our results exhibit superior ID accuracy, text-based manipulation ability, and generalization compared to previous methods.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

生成式人工智能

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='DL6bI'><del id='aUkAm'><del id='DpRj8'></del><pre id='0vDi5'><pre id='CsuNq'><option id='0Yjg2'><address id='dCRd9'></address><bdo id='cnALT'><tr id='IMKY8'><acronym id='HqBBl'><pre id='xr7JT'></pre></acronym><div id='lrrVJ'></div></tr></bdo></option></pre><small id='lLdIj'><address id='bINk5'><u id='V63zw'><legend id='ZRFQT'><option id='SwS6E'><abbr id='VGhuX'></abbr><li id='V7PcH'><pre id='8IbP7'></pre></li></option></legend><select id='9MrGA'></select></u></address></small></pre></del><sup id='CzHn4'></sup><blockquote id='DWKb1'><dt id='Y3iey'></dt></blockquote><blockquote id='EAtwj'></blockquote></dir><tt id='YqZ2y'></tt><u id='rPZTU'><tt id='felEb'><form id='v05Au'></form></tt><td id='0vNlT'><dt id='fz8rB'></dt></td></u>

<code id='3VpP4'><i id='9Xbwe'><q id='zcU32'><legend id='lNSdF'><pre id='MGRjU'><style id='90gN4'><acronym id='qPPAf'><i id='m4tuN'><form id='WDzhV'><option id='78M8L'><center id='fo7LY'></center></option></form></i></acronym></style><tt id='veg55'></tt></pre></legend></q></i></code><center id='0RyDX'></center>

<dd id='NxucT'></dd>

<style id='doPvq'></style><sub id='Otzke'><dfn id='IGrYS'><abbr id='YRpsz'><big id='c23mf'><bdo id='fZ8s2'></bdo></big></abbr></dfn></sub>_{<dir id='Zvgsx'></dir>}