亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='joea9'></tfoot>

<legend id='joea9'><style id='joea9'><dir id='joea9'><q id='joea9'></q></dir></style></legend>

<i id='joea9'><tr id='joea9'><dt id='joea9'><q id='joea9'><span id='joea9'><b id='joea9'><form id='joea9'><ins id='joea9'></ins><ul id='joea9'></ul><sub id='joea9'></sub></form><legend id='joea9'></legend><bdo id='joea9'><pre id='joea9'><center id='joea9'></center></pre></bdo></b><th id='joea9'></th></span></q></dt></tr></i><div id='joea9'><tfoot id='joea9'></tfoot><dl id='joea9'><fieldset id='joea9'></fieldset></dl></div>

·

HTTPS · SimPLe · 三維重建 · Networking · 樣例 ·

2023 年 6 月 23 日

LightGlue: Local Feature Matching at Light Speed

Philipp Lindenberger,Paul-Edouard Sarlin,Marc Pollefeys

We introduce LightGlue, a deep neural network that learns to match local features across images. We revisit multiple design decisions of SuperGlue, the state of the art in sparse matching, and derive simple but effective improvements. Cumulatively, they make LightGlue more efficient - in terms of both memory and computation, more accurate, and much easier to train. One key property is that LightGlue is adaptive to the difficulty of the problem: the inference is much faster on image pairs that are intuitively easy to match, for example because of a larger visual overlap or limited appearance change. This opens up exciting prospects for deploying deep matchers in latency-sensitive applications like 3D reconstruction. The code and trained models are publicly available at //github.com/cvg/LightGlue.

相關內容

HTTPS

超文本傳(chuan)輸安全(quan)協(xie)議(yi)是超文本傳(chuan)輸協(xie)議(yi)和 SSL/TLS 的組合(he)，用以提供加(jia)密通(tong)訊(xun)及對網絡服(fu)務器身份(fen)的鑒(jian)定。

Learning · Backbone · MoDELS · 原點 · 樣例 ·

2023 年 8 月 16 日

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

Yulin Wang,Yang Yue,Rui Lu,Tianjiao Liu,Zhao Zhong,Shiji Song,Gao Huang

from arxiv, ICCV 2023

The superior performance of modern deep networks usually comes with a costly training procedure. This paper presents a new curriculum learning approach for the efficient training of visual backbones (e.g., vision Transformers). Our work is inspired by the inherent learning dynamics of deep networks: we experimentally show that at an earlier training stage, the model mainly learns to recognize some 'easier-to-learn' discriminative patterns within each example, e.g., the lower-frequency components of images and the original information before data augmentation. Driven by this phenomenon, we propose a curriculum where the model always leverages all the training data at each epoch, while the curriculum starts with only exposing the 'easier-to-learn' patterns of each example, and introduces gradually more difficult patterns. To implement this idea, we 1) introduce a cropping operation in the Fourier spectrum of the inputs, which enables the model to learn from only the lower-frequency components efficiently, 2) demonstrate that exposing the features of original images amounts to adopting weaker data augmentation, and 3) integrate 1) and 2) and design a curriculum learning schedule with a greedy-search algorithm. The resulting approach, EfficientTrain, is simple, general, yet surprisingly effective. As an off-the-shelf method, it reduces the wall-time training cost of a wide variety of popular models (e.g., ResNet, ConvNeXt, DeiT, PVT, Swin, and CSWin) by >1.5x on ImageNet-1K/22K without sacrificing accuracy. It is also effective for self-supervised learning (e.g., MAE). Code is available at //github.com/LeapLabTHU/EfficientTrain.

正則的 · Processing（編程語言） · 優化器 · HTTPS · 正則化項 ·

2023 年 8 月 15 日

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Hao Ouyang,Qiuyu Wang,Yuxi Xiao,Qingyan Bai,Juntao Zhang,Kecheng Zheng,Xiaowei Zhou,Qifeng Chen,Yujun Shen

from arxiv, Project Webpage: //qiuyu96.github.io/CoDeF/, Code: //github.com/qiuyu96/CoDeF

We present the content deformation field CoDeF as a new type of video representation, which consists of a canonical content field aggregating the static contents in the entire video and a temporal deformation field recording the transformations from the canonical image (i.e., rendered from the canonical content field) to each individual frame along the time axis.Given a target video, these two fields are jointly optimized to reconstruct it through a carefully tailored rendering pipeline.We advisedly introduce some regularizations into the optimization process, urging the canonical content field to inherit semantics (e.g., the object shape) from the video.With such a design, CoDeF naturally supports lifting image algorithms for video processing, in the sense that one can apply an image algorithm to the canonical image and effortlessly propagate the outcomes to the entire video with the aid of the temporal deformation field.We experimentally show that CoDeF is able to lift image-to-image translation to video-to-video translation and lift keypoint detection to keypoint tracking without any training.More importantly, thanks to our lifting strategy that deploys the algorithms on only one image, we achieve superior cross-frame consistency in processed videos compared to existing video-to-video translation approaches, and even manage to track non-rigid objects like water and smog.Project page can be found at //qiuyu96.github.io/CoDeF/.

Color · 解碼 · Extensibility · state-of-the-art · Performer ·

2023 年 8 月 15 日

DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Xiaoyang Kang,Tao Yang,Wenqi Ouyang,Peiran Ren,Lingzhi Li,Xuansong Xie

from arxiv, ICCV 2023; Code: //github.com/piddnad/DDColor

Image colorization is a challenging problem due to multi-modal uncertainty and high ill-posedness. Directly training a deep neural network usually leads to incorrect semantic colors and low color richness. While transformer-based methods can deliver better results, they often rely on manually designed priors, suffer from poor generalization ability, and introduce color bleeding effects. To address these issues, we propose DDColor, an end-to-end method with dual decoders for image colorization. Our approach includes a pixel decoder and a query-based color decoder. The former restores the spatial resolution of the image, while the latter utilizes rich visual features to refine color queries, thus avoiding hand-crafted priors. Our two decoders work together to establish correlations between color and multi-scale semantic representations via cross-attention, significantly alleviating the color bleeding effect. Additionally, a simple yet effective colorfulness loss is introduced to enhance the color richness. Extensive experiments demonstrate that DDColor achieves superior performance to existing state-of-the-art works both quantitatively and qualitatively. The codes and models are publicly available at //github.com/piddnad/DDColor.

MoDELS · 優化器 · 推斷 · Extensibility · 簇 ·

2023 年 8 月 14 日

Symphony: Optimized Model Serving using Centralized Orchestration

Lequn Chen,Weixin Deng,Anirudh Canumalla,Yu Xin,Matthai Philipose,Arvind Krishnamurthy

The orchestration of deep neural network (DNN) model inference on GPU clusters presents two significant challenges: achieving high accelerator efficiency given the batching properties of model inference while meeting latency service level objectives (SLOs), and adapting to workload changes both in terms of short-term fluctuations and long-term resource allocation. To address these challenges, we propose Symphony, a centralized scheduling system that can scale to millions of requests per second and coordinate tens of thousands of GPUs. Our system utilizes a non-work-conserving scheduling algorithm capable of achieving high batch efficiency while also enabling robust autoscaling. Additionally, we developed an epoch-scale algorithm that allocates models to sub-clusters based on the compute and memory needs of the models. Through extensive experiments, we demonstrate that Symphony outperforms prior systems by up to 4.7x higher goodput.

MoDELS · Vision · 去噪 · 前向 · 可辨認的 ·

2022 年 9 月 10 日

Diffusion Models in Vision: A Survey

Florinel-Alin Croitoru,Vlad Hondru,Radu Tudor Ionescu,Mubarak Shah

from arxiv, 20 pages, 3 figures

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion process, step by step. Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens, i.e. low speeds due to the high number of steps involved during sampling. In this survey, we provide a comprehensive review of articles on denoising diffusion models applied in vision, comprising both theoretical and practical contributions in the field. First, we identify and present three generic diffusion modeling frameworks, which are based on denoising diffusion probabilistic models, noise conditioned score networks, and stochastic differential equations. We further discuss the relations between diffusion models and other deep generative models, including variational auto-encoders, generative adversarial networks, energy-based models, autoregressive models and normalizing flows. Then, we introduce a multi-perspective categorization of diffusion models applied in computer vision. Finally, we illustrate the current limitations of diffusion models and envision some interesting directions for future research.

Extensibility · 噪聲 · Performer · state-of-the-art · 學成 ·

2021 年 6 月 30 日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Sicheng Zhao,Xingxu Yao,Jufeng Yang,Guoli Jia,Guiguang Ding,Tat-Seng Chua,Bj?rn W. Schuller,Kurt Keutzer

from arxiv, Accepted by IEEE TPAMI

Images can convey rich semantics and induce various emotions in viewers. Recently, with the rapid advancement of emotional intelligence and the explosive growth of visual data, extensive research efforts have been dedicated to affective image content analysis (AICA). In this survey, we will comprehensively review the development of AICA in the recent two decades, especially focusing on the state-of-the-art methods with respect to three main challenges -- the affective gap, perception subjectivity, and label noise and absence. We begin with an introduction to the key emotion representation models that have been widely employed in AICA and description of available datasets for performing evaluation with quantitative comparison of label noise and dataset bias. We then summarize and compare the representative approaches on (1) emotion feature extraction, including both handcrafted and deep features, (2) learning methods on dominant emotion recognition, personalized emotion prediction, emotion distribution learning, and learning from noisy data or few labels, and (3) AICA based applications. Finally, we discuss some challenges and promising research directions in the future, such as image content and context understanding, group emotion clustering, and viewer-image interaction.

Neural Networks · 圖 · Networks · Taxonomy · 圖神經網絡 ·

2020 年 12 月 16 日

Graph Neural Networks: Taxonomy, Advances and Trends

Yu Zhou,Haixia Zheng,Xin Huang

from arxiv, 50 pages, 7 figures

Graph neural networks provide a powerful toolkit for embedding real-world graphs into low-dimensional spaces according to specific tasks. Up to now, there have been several surveys on this topic. However, they usually lay emphasis on different angles so that the readers can not see a panorama of the graph neural networks. This survey aims to overcome this limitation, and provide a comprehensive review on the graph neural networks. First of all, we provide a novel taxonomy for the graph neural networks, and then refer to up to 400 relevant literatures to show the panorama of the graph neural networks. All of them are classified into the corresponding categories. In order to drive the graph neural networks into a new stage, we summarize four future research directions so as to overcome the facing challenges. It is expected that more and more scholars can understand and exploit the graph neural networks, and use them in their research community.

圖 · 知識圖譜 · 鏈路預測 · Extensibility · entity ·

2020 年 10 月 6 日

CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

Tara Safavi,Danai Koutra

from arxiv, EMNLP 2020

We present CoDEx, a set of knowledge graph completion datasets extracted from Wikidata and Wikipedia that improve upon existing knowledge graph completion benchmarks in scope and level of difficulty. In terms of scope, CoDEx comprises three knowledge graphs varying in size and structure, multilingual descriptions of entities and relations, and tens of thousands of hard negative triples that are plausible but verified to be false. To characterize CoDEx, we contribute thorough empirical analyses and benchmarking experiments. First, we analyze each CoDEx dataset in terms of logical relation patterns. Next, we report baseline link prediction and triple classification results on CoDEx for five extensively tuned embedding models. Finally, we differentiate CoDEx from the popular FB15K-237 knowledge graph completion dataset by showing that CoDEx covers more diverse and interpretable content, and is a more difficult link prediction benchmark. Data, code, and pretrained models are available at //bit.ly/2EPbrJs.

contrastive · Performer · 無監督 · 控制器 · 學成 ·

2020 年 4 月 28 日

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Aravind Srinivas,Michael Laskin,Pieter Abbeel

from arxiv, First two authors contributed equally, website: //mishalaskin.github.io/curl code: //github.com/MishaLaskin/curl

We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.6x performance gains at the 100K environment and interaction steps benchmarks respectively. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency and performance of methods that use state-based features.

Capsule · CapsNet · Networking · 判別器 · MoDELS ·

2018 年 2 月 17 日

CapsuleGAN: Generative Adversarial Capsule Network

Ayush Jaiswal,Wael AbdAlmageed,Premkumar Natarajan

We present Generative Adversarial Capsule Network (CapsuleGAN), a framework that uses capsule networks (CapsNets) instead of the standard convolutional neural networks (CNNs) as discriminators within the generative adversarial network (GAN) setting, while modeling image data. We provide guidelines for designing CapsNet discriminators and the updated GAN objective function, which incorporates the CapsNet margin loss, for training CapsuleGAN models. We show that CapsuleGAN outperforms convolutional-GAN at modeling image data distribution on the MNIST dataset of handwritten digits, evaluated on the generative adversarial metric and at semi-supervised image classification.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='joea9'></tfoot>

<legend id='joea9'><style id='joea9'><dir id='joea9'><q id='joea9'></q></dir></style></legend>

<i id='joea9'><tr id='joea9'><dt id='joea9'><q id='joea9'><span id='joea9'><b id='joea9'><form id='joea9'><ins id='joea9'></ins><ul id='joea9'></ul><sub id='joea9'></sub></form><legend id='joea9'></legend><bdo id='joea9'><pre id='joea9'><center id='joea9'></center></pre></bdo></b><th id='joea9'></th></span></q></dt></tr></i><div id='joea9'><tfoot id='joea9'></tfoot><dl id='joea9'><fieldset id='joea9'></fieldset></dl></div>