亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='KQxQu'><strong id='b1hXw'></strong><small id='6vf4G'></small><button id='bu547'></button><li id='JUd5L'><noscript id='7sUI7'><big id='GKYrx'></big><dt id='xaydX'></dt></noscript></li></tr><ol id='7Ocmz'><option id='pZg0L'><table id='8U2ha'><blockquote id='ZNDfx'><tbody id='SfJ1Z'></tbody></blockquote></table></option></ol><u id='vHXFi'></u><kbd id='a61fa'><kbd id='5R2Ev'></kbd></kbd>

<code id='Akegy'><strong id='Of7mW'></strong></code>

<fieldset id='iVjSU'></fieldset>

<span id='OnF7J'></span>

<ins id='nj0rW'></ins>

<acronym id='x2FZs'><em id='5HFqJ'></em><td id='YvsVi'><div id='TaCg6'></div></td></acronym><address id='LJBkf'><big id='VAbU2'><big id='XrT8O'></big><legend id='3Rxuz'></legend></big></address>

<i id='58IKf'><div id='M5Kve'><ins id='JelVZ'></ins></div></i>

<i id='ah6Q4'></i>

·

Extensibility · 損失 · MoDELS · state-of-the-art · Performer ·

2023 年 3 月 22 日

Road Extraction with Satellite Images and Partial Road Maps

Qianxiong Xu,Cheng Long,Liang Yu,Chen Zhang

from arxiv, This paper has been accepted by IEEE Transactions on Geoscience and Remote Sensing

Road extraction is a process of automatically generating road maps mainly from satellite images. Existing models all target to generate roads from the scratch despite that a large quantity of road maps, though incomplete, are publicly available (e.g. those from OpenStreetMap) and can help with road extraction. In this paper, we propose to conduct road extraction based on satellite images and partial road maps, which is new. We then propose a two-branch Partial to Complete Network (P2CNet) for the task, which has two prominent components: Gated Self-Attention Module (GSAM) and Missing Part (MP) loss. GSAM leverages a channel-wise self-attention module and a gate module to capture long-range semantics, filter out useless information, and better fuse the features from two branches. MP loss is derived from the partial road maps, trying to give more attention to the road pixels that do not exist in partial road maps. Extensive experiments are conducted to demonstrate the effectiveness of our model, e.g. P2CNet achieves state-of-the-art performance with the IoU scores of 70.71% and 75.52%, respectively, on the SpaceNet and OSM datasets.

相關內容

Extensibility

iOS 8 提供的應用間和應用跟系統的功能交互特性。

Today (iOS and OS X): widgets for the Today view of Notification Center
Share (iOS and OS X): post content to web services or share content with others
Actions (iOS and OS X): app extensions to view or manipulate inside another app
Photo Editing (iOS): edit a photo or video in Apple's Photos app with extensions from a third-party apps
Finder Sync (OS X): remote file storage in the Finder with support for Finder content annotation
Storage Provider (iOS): an interface between files inside an app and other apps on a user's device
Custom Keyboard (iOS): system-wide alternative keyboards

Source:

prototype · Learning · 小樣本學習 · 可辨認的 · 表示學習 ·

2023 年 5 月 10 日

Cross-Domain Few-Shot Relation Extraction via Representation Learning and Domain Adaptation

Zhongju Yuan,Zhenkun Wang,Genghui Li

Few-shot relation extraction aims to recognize novel relations with few labeled sentences in each relation. Previous metric-based few-shot relation extraction algorithms identify relationships by comparing the prototypes generated by the few labeled sentences embedding with the embeddings of the query sentences using a trained metric function. However, as these domains always have considerable differences from those in the training dataset, the generalization ability of these approaches on unseen relations in many domains is limited. Since the prototype is necessary for obtaining relationships between entities in the latent space, we suggest learning more interpretable and efficient prototypes from prior knowledge and the intrinsic semantics of relations to extract new relations in various domains more effectively. By exploring the relationships between relations using prior information, we effectively improve the prototype representation of relations. By using contrastive learning to make the classification margins between sentence embedding more distinct, the prototype's geometric interpretability is enhanced. Additionally, utilizing a transfer learning approach for the cross-domain problem allows the generation process of the prototype to account for the gap between other domains, making the prototype more robust and enabling the better extraction of associations across multiple domains. The experiment results on the benchmark FewRel dataset demonstrate the advantages of the suggested method over some state-of-the-art approaches.

特征空間 · 圖形處理器 · Neural Networks · 圖 · Networking ·

2023 年 5 月 10 日

Feature Expansion for Graph Neural Networks

Jiaqi Sun,Lin Zhang,Guangyi Chen,Kun Zhang,Peng XU,Yujiu Yang

from arxiv, Accepted by ICML'23

Graph neural networks aim to learn representations for graph-structured data and show impressive performance, particularly in node classification. Recently, many methods have studied the representations of GNNs from the perspective of optimization goals and spectral graph theory. However, the feature space that dominates representation learning has not been systematically studied in graph neural networks. In this paper, we propose to fill this gap by analyzing the feature space of both spatial and spectral models. We decompose graph neural networks into determined feature spaces and trainable weights, providing the convenience of studying the feature space explicitly using matrix space analysis. In particular, we theoretically find that the feature space tends to be linearly correlated due to repeated aggregations. Motivated by these findings, we propose 1) feature subspaces flattening and 2) structural principal components to expand the feature space. Extensive experiments verify the effectiveness of our proposed more comprehensive feature space, with comparable inference time to the baseline, and demonstrate its efficient convergence capability.

估計/估計量 · 可理解性 · state-of-the-art · MoDELS · RE ·

2023 年 5 月 10 日

Transformer-based model for monocular visual odometry: a video understanding approach

André O. Fran?ani,Marcos R. O. A. Maximo

Estimating the camera pose given images of a single camera is a traditional task in mobile robots and autonomous vehicles. This problem is called monocular visual odometry and it often relies on geometric approaches that require engineering effort for a specific scenario. Deep learning methods have shown to be generalizable after proper training and a considerable amount of available data. Transformer-based architectures have dominated the state-of-the-art in natural language processing and computer vision tasks, such as image and video understanding. In this work, we deal with the monocular visual odometry as a video understanding task to estimate the 6-DoF camera's pose. We contribute by presenting the TSformer-VO model based on spatio-temporal self-attention mechanisms to extract features from clips and estimate the motions in an end-to-end manner. Our approach achieved competitive state-of-the-art performance compared with geometry-based and deep learning-based methods on the KITTI visual odometry dataset, outperforming the DeepVO implementation highly accepted in the visual odometry community.

Learning · contrastive · prototype · 表示 · 標注 ·

2023 年 5 月 10 日

Towards Effective Visual Representations for Partial-Label Learning

Shiyu Xia,Jiaqi Lv,Ning Xu,Gang Niu,Xin Geng

Under partial-label learning (PLL) where, for each training instance, only a set of ambiguous candidate labels containing the unknown true label is accessible, contrastive learning has recently boosted the performance of PLL on vision tasks, attributed to representations learned by contrasting the same/different classes of entities. Without access to true labels, positive points are predicted using pseudo-labels that are inherently noisy, and negative points often require large batches or momentum encoders, resulting in unreliable similarity information and a high computational overhead. In this paper, we rethink a state-of-the-art contrastive PLL method PiCO[24], inspiring the design of a simple framework termed PaPi (Partial-label learning with a guided Prototypical classifier), which demonstrates significant scope for improvement in representation learning, thus contributing to label disambiguation. PaPi guides the optimization of a prototypical classifier by a linear classifier with which they share the same feature encoder, thus explicitly encouraging the representation to reflect visual similarity between categories. It is also technically appealing, as PaPi requires only a few components in PiCO with the opposite direction of guidance, and directly eliminates the contrastive learning module that would introduce noise and consume computational resources. We empirically demonstrate that PaPi significantly outperforms other PLL methods on various image classification tasks.

MoDELS · Performer · GANs · Networking · Processing（編程語言） ·

2023 年 5 月 10 日

Post-training Model Quantization Using GANs for Synthetic Data Generation

Athanasios Masouris,Mansi Sharma,Adrian Boguszewski,Alexander Kozlov,Zhuo Wu,Raymond Lo

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact, which requires data from the target domain, such as a fraction of the dataset used in model training and model validation (i.e. calibration dataset). In this study, we investigate the use of synthetic data as a substitute for the calibration with real data for the quantization method. We propose a data generation method based on Generative Adversarial Networks that are trained prior to the model quantization step. We compare the performance of models quantized using data generated by StyleGAN2-ADA and our pre-trained DiStyleGAN, with quantization using real data and an alternative data generation method based on fractal images. Overall, the results of our experiments demonstrate the potential of leveraging synthetic data for calibration during the quantization process. In our experiments, the percentage of accuracy degradation of the selected models was less than 0.6%, with our best performance achieved on MobileNetV2 (0.05%). The code is available at: //github.com/ThanosM97/gsoc2022-openvino

Learning · contrastive · 對比學習 · 可約的 · HTTPS ·

2023 年 5 月 10 日

Weakly-supervised ROI extraction method based on contrastive learning for remote sensing images

Lingfeng He,Mengze Xu,Jie Ma

ROI extraction is an active but challenging task in remote sensing because of the complicated landform, the complex boundaries and the requirement of annotations. Weakly supervised learning (WSL) aims at learning a mapping from input image to pixel-wise prediction under image-wise labels, which can dramatically decrease the labor cost. However, due to the imprecision of labels, the accuracy and time consumption of WSL methods are relatively unsatisfactory. In this paper, we propose a two-step ROI extraction based on contractive learning. Firstly, we present to integrate multiscale Grad-CAM to obtain pseudo pixelwise annotations with well boundaries. Then, to reduce the compact of misjudgments in pseudo annotations, we construct a contrastive learning strategy to encourage the features inside ROI as close as possible and separate background features from foreground features. Comprehensive experiments demonstrate the superiority of our proposal. Code is available at //github.com/HE-Lingfeng/ROI-Extraction

Mixup · state-of-the-art · 穩健性 · Learning · 損失 ·

2023 年 5 月 7 日

Learning Robust Self-attention Features for Speech Emotion Recognition with Label-adaptive Mixup

Lei Kang,Lichao Zhang,Dazhi Jiang

from arxiv, Accepted to ICASSP 2023

Speech Emotion Recognition (SER) is to recognize human emotions in a natural verbal interaction scenario with machines, which is considered as a challenging problem due to the ambiguous human emotions. Despite the recent progress in SER, state-of-the-art models struggle to achieve a satisfactory performance. We propose a self-attention based method with combined use of label-adaptive mixup and center loss. By adapting label probabilities in mixup and fitting center loss to the mixup training scheme, our proposed method achieves a superior performance to the state-of-the-art methods.

多峰值 · Better · Processing（編程語言） · 單峰值 · INFORMS ·

2020 年 4 月 10 日

Multimodal Categorization of Crisis Events in Social Media

Mahdi Abavisani,Liwei Wu,Shengli Hu,Joel Tetreault,Alejandro Jaimes

from arxiv, Conference on Computer Vision and Pattern Recognition (CVPR 2020)

Recent developments in image classification and natural language processing, coupled with the rapid growth in social media usage, have enabled fundamental advances in detecting breaking events around the world in real-time. Emergency response is one such area that stands to gain from these advances. By processing billions of texts and images a minute, events can be automatically detected to enable emergency response workers to better assess rapidly evolving situations and deploy resources accordingly. To date, most event detection techniques in this area have focused on image-only or text-only approaches, limiting detection performance and impacting the quality of information delivered to crisis response teams. In this paper, we present a new multimodal fusion method that leverages both images and texts as input. In particular, we introduce a cross-attention module that can filter uninformative and misleading components from weak modalities on a sample by sample basis. In addition, we employ a multimodal graph-based approach to stochastically transition between embeddings of different multimodal pairs during training to better regularize the learning process as well as dealing with limited training data by constructing new matched pairs from different samples. We show that our method outperforms the unimodal approaches and strong multimodal baselines by a large margin on three crisis-related tasks.

數據增強 · 泛化理論 · 矩 · 規范化的 · surge ·

2020 年 2 月 25 日

On Feature Normalization and Data Augmentation

Boyi Li,Felix Wu,Ser-Nam Lim,Serge Belongie,Kilian Q. Weinberger

Modern neural network training relies heavily on data augmentation for improved generalization. After the initial success of label-preserving augmentations, there has been a recent surge of interest in label-perturbing approaches, which combine features and labels across training samples to smooth the learned decision surface. In this paper, we propose a new augmentation method that leverages the first and second moments extracted and re-injected by feature normalization. We replace the moments of the learned features of one training image by those of another, and also interpolate the target labels. As our approach is fast, operates entirely in feature space, and mixes different signals than prior methods, one can effectively combine it with existing augmentation methods. We demonstrate its efficacy across benchmark data sets in computer vision, speech, and natural language processing, where it consistently improves the generalization performance of highly competitive baseline networks.

去噪 · 白盒 · 穩健性 · Networking · 模型評估 ·

2018 年 12 月 9 日

Feature Denoising for Improving Adversarial Robustness

Cihang Xie,Yuxin Wu,Laurens van der Maaten,Alan Yuille,Kaiming He

from arxiv, tech report

Adversarial attacks to image classification systems present challenges to convolutional networks and opportunities for understanding them. This study suggests that adversarial perturbations on images lead to noise in the features constructed by these networks. Motivated by this observation, we develop new network architectures that increase adversarial robustness by performing feature denoising. Specifically, our networks contain blocks that denoise the features using non-local means or other filters; the entire networks are trained end-to-end. When combined with adversarial training, our feature denoising networks substantially improve the state-of-the-art in adversarial robustness in both white-box and black-box attack settings. On ImageNet, under 10-iteration PGD white-box attacks where prior art has 27.9% accuracy, our method achieves 55.7%; even under extreme 2000-iteration PGD white-box attacks, our method secures 42.6% accuracy. A network based on our method was ranked first in Competition on Adversarial Attacks and Defenses (CAAD) 2018 --- it achieved 50.6% classification accuracy on a secret, ImageNet-like test dataset against 48 unknown attackers, surpassing the runner-up approach by ~10%. Code and models will be made publicly available.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

state-of-the-art

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='m5p5v'></tfoot>

<legend id='m5p5v'><style id='m5p5v'><dir id='m5p5v'><q id='m5p5v'></q></dir></style></legend>

<i id='m5p5v'><tr id='m5p5v'><dt id='m5p5v'><q id='m5p5v'><span id='m5p5v'><b id='m5p5v'><form id='m5p5v'><ins id='m5p5v'></ins><ul id='m5p5v'></ul><sub id='m5p5v'></sub></form><legend id='m5p5v'></legend><bdo id='m5p5v'><pre id='m5p5v'><center id='m5p5v'></center></pre></bdo></b><th id='m5p5v'></th></span></q></dt></tr></i><div id='m5p5v'><tfoot id='m5p5v'></tfoot><dl id='m5p5v'><fieldset id='m5p5v'></fieldset></dl></div>