亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<form id='nVnRQ'></form>

<bdo id='FWaxA'><sup id='JElzC'><div id='GqJI9'><bdo id='bCkUR'></bdo></div></sup></bdo>

·

控制器 · 機器人 · 設計 · state-of-the-art · Performer ·

2023 年 10 月 29 日

Design and Experimental Evaluation of a Haptic Robot-Assisted System for Femur Fracture Surgery

Fayez H. Alruwaili,Michael P. Clancy,Marzieh S. Saeedi-Hosseiny,Jacob A. Logar,Charalampos Papachristou,Christopher Haydel,Javad Parvizi,Iulian I. Iordachita,Mohammad H. Abedin-Nasab

from arxiv, This paper is to be submitted to an IEEE journal

In the face of challenges encountered during femur fracture surgery, such as the high rates of malalignment and X-ray exposure to operating personnel, robot-assisted surgery has emerged as an alternative to conventional state-of-the-art surgical methods. This paper introduces the development of Robossis, a haptic system for robot-assisted femur fracture surgery. Robossis comprises a 7-DOF haptic controller and a 6-DOF surgical robot. A unilateral control architecture is developed to address the kinematic mismatch and the motion transfer between the haptic controller and the Robossis surgical robot. A real-time motion control pipeline is designed to address the motion transfer and evaluated through experimental testing. The analysis illustrates that the Robossis surgical robot can adhere to the desired trajectory from the haptic controller with an average translational error of 0.32 mm and a rotational error of 0.07 deg. Additionally, a haptic rendering pipeline is developed to resolve the kinematic mismatch by constraining the haptic controller (user hand) movement within the permissible joint limits of the Robossis surgical robot. Lastly, in a cadaveric lab test, the Robossis system assisted surgeons during a mock femur fracture surgery. The result shows that Robossis can provide an intuitive solution for surgeons to perform femur fracture surgery.

相關內容

控制器

平穩的 · Processing（編程語言） · Analysis · 服務器 · 近似 ·

2023 年 12 月 16 日

Diffusion Approximations of Speed-Aware Join-the-Shortest-Queue Scheme: Transient and Stationary Analysis

Sanidhay Bhambay,Burak Büke,Arpan Mukhopadhyay

The Join-the-Shortest-Queue (JSQ) load balancing scheme is widely acknowledged for its effectiveness in minimizing the average response time for jobs in systems with identical servers. However, when applied to a heterogeneous server system with servers of different processing speeds, the JSQ scheme exhibits suboptimal performance. Recently, a variation of JSQ called the Speed-Aware-Join-the-Shortest-Queue (SA-JSQ) scheme has been shown to attain fluid limit optimality for systems with heterogeneous servers. In this paper, we examine the SA-JSQ scheme for heterogeneous server systems under the Halfin-Whitt regime. Our analysis begins by establishing that the scaled and centered version of the system state weakly converges to a diffusion process characterized by stochastic integral equations. Furthermore, we prove that the diffusion process is positive recurrent and the sequence of stationary measures for the scaled and centered queue length processes converge to the stationary measure for the limiting diffusion process. To achieve this result, we employ Stein's method with a generator expansion approach.

對數幾率回歸 · Pattern Recognition · Notability · MSR · 描述符 ·

2023 年 12 月 16 日

Fusion of Deep and Shallow Features for Face Kinship Verification

Belabbaci El Ouanas,Khammari Mohammed,Chouchane Ammar,Mohcene Bessaoudi,Abdelmalik Ouamane,Akram Abderraouf Gharbi

from arxiv, arXiv admin note: text overlap with arXiv:2312.03562

Kinship verification from face images is a novel and formidable challenge in the realms of pattern recognition and computer vision. This work makes notable contributions by incorporating a preprocessing technique known as Multiscale Retinex (MSR), which enhances image quality. Our approach harnesses the strength of complementary deep (VGG16) and shallow texture descriptors (BSIF) by combining them at the score level using Logistic Regression (LR) technique. We assess the effectiveness of our approach by conducting comprehensive experiments on three challenging kinship datasets: Cornell Kin Face, UB Kin Face and TS Kin Face

DeepFakes · 生成器網絡 · Analysis · 操作 · Networking ·

2023 年 12 月 16 日

Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection

Chuangchuang Tan,Yao Zhao,Shikui Wei,Guanghua Gu,Ping Liu,Yunchao Wei

from arxiv, 10 pages, 4 figures

Recently, the proliferation of highly realistic synthetic images, facilitated through a variety of GANs and Diffusions, has significantly heightened the susceptibility to misuse. While the primary focus of deepfake detection has traditionally centered on the design of detection algorithms, an investigative inquiry into the generator architectures has remained conspicuously absent in recent years. This paper contributes to this lacuna by rethinking the architectures of CNN-based generators, thereby establishing a generalized representation of synthetic artifacts. Our findings illuminate that the up-sampling operator can, beyond frequency-based artifacts, produce generalized forgery artifacts. In particular, the local interdependence among image pixels caused by upsampling operators is significantly demonstrated in synthetic images generated by GAN or diffusion. Building upon this observation, we introduce the concept of Neighboring Pixel Relationships(NPR) as a means to capture and characterize the generalized structural artifacts stemming from up-sampling operations. A comprehensive analysis is conducted on an open-world dataset, comprising samples generated by \tft{28 distinct generative models}. This analysis culminates in the establishment of a novel state-of-the-art performance, showcasing a remarkable \tft{12.8\%} improvement over existing methods. The code is available at //github.com/chuangchuangtan/NPR-DeepfakeDetection.

磁流變材料 · 系統設計 · 設計 · INFORMS · Performer ·

2023 年 12 月 14 日

Collaborative System Design of Mixed Reality Communication for Medical Training

Manuel Rebol,Krzysztof Pietroszek,Claudia Ranniger,Colton Hood,Adam Rutenberg,Neal Sikka,Christian Guetl

from arxiv, //hdl.handle.net/10125/102680

We present the design of a mixed reality (MR) telehealth training system that aims to close the gap between in-person and distance training and re-training for medical procedures. Our system uses real-time volumetric capture as a means for communicating and relating spatial information between the non-colocated trainee and instructor. The system's design is based on a requirements elicitation study performed in situ, at a medical school simulation training center. The focus is on the lightweight real-time transmission of volumetric data - meaning the use of consumer hardware, easy and quick deployment, and low-demand computations. We evaluate the MR system design by analyzing the workload for the users during medical training. We compare in-person, video, and MR training workloads. The results indicate that the overall workload for central line placement training with MR does not increase significantly compared to video communication. Our work shows that, when designed strategically together with domain experts, an MR communication system can be used effectively for complex medical procedural training without increasing the overall workload for users significantly. Moreover, MR systems offer new opportunities for teaching due to spatial information, hand tracking, and augmented communication.

估計/估計量 · 閾值 · 性能度量 · Performer · 似然 ·

2023 年 12 月 14 日

F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection

Kevin Wilkinghoff,Keisuke Imoto

from arxiv, Accepted for presentation at IEEE ICASSP 2024

Anomalous sound detection (ASD) systems are usually compared by using threshold-independent performance measures such as AUC-ROC. However, for practical applications a decision threshold is needed to decide whether a given test sample is normal or anomalous. Estimating such a threshold is highly non-trivial in a semi-supervised setting where only normal training samples are available. In this work, F1-EV a novel threshold-independent performance measure for ASD systems that also includes the likelihood of estimating a good decision threshold is proposed and motivated using specific toy examples. In experimental evaluations, multiple performance measures are evaluated for all systems submitted to the ASD task of the DCASE Challenge 2023. It is shown that F1-EV is strongly correlated with AUC-ROC while having a significantly stronger correlation with the F1-score obtained with estimated and optimal decision thresholds than AUC-ROC.

圖像字幕 · CLIP · INFORMS · 推斷 · 模態 ·

2023 年 12 月 14 日

Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image Captioning

Zhiyue Liu,Jinyuan Liu,Fanrong Ma

from arxiv, AAAI2024

Although image captioning models have made significant advancements in recent years, the majority of them heavily depend on high-quality datasets containing paired images and texts which are costly to acquire. Previous works leverage the CLIP's cross-modal association ability for image captioning, relying solely on textual information under unsupervised settings. However, not only does a modality gap exist between CLIP text and image features, but a discrepancy also arises between training and inference due to the unavailability of real-world images, which hinders the cross-modal alignment in text-only captioning. This paper proposes a novel method to address these issues by incorporating synthetic image-text pairs. A pre-trained text-to-image model is deployed to obtain images that correspond to textual data, and the pseudo features of generated images are optimized toward the real ones in the CLIP embedding space. Furthermore, textual information is gathered to represent image features, resulting in the image features with various semantics and the bridged modality gap. To unify training and inference, synthetic image features would serve as the training prefix for the language decoder, while real images are used for inference. Additionally, salient objects in images are detected as assistance to enhance the learning of modality alignment. Experimental results demonstrate that our method obtains the state-of-the-art performance on benchmark datasets.

MoDELS · Performer · Processing（編程語言） · 學成 · 穩健性 ·

2021 年 9 月 3 日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

from arxiv, PhD thesis

The dominating NLP paradigm of training a strong neural predictor to perform one task on a specific dataset has led to state-of-the-art performance in a variety of applications (eg. sentiment classification, span-prediction based question answering or machine translation). However, it builds upon the assumption that the data distribution is stationary, ie. that the data is sampled from a fixed distribution both at training and test time. This way of training is inconsistent with how we as humans are able to learn from and operate within a constantly changing stream of information. Moreover, it is ill-adapted to real-world use cases where the data distribution is expected to shift over the course of a model's lifetime. The first goal of this thesis is to characterize the different forms this shift can take in the context of natural language processing, and propose benchmarks and evaluation metrics to measure its effect on current deep learning architectures. We then proceed to take steps to mitigate the effect of distributional shift on NLP models. To this end, we develop methods based on parametric reformulations of the distributionally robust optimization framework. Empirically, we demonstrate that these approaches yield more robust models as demonstrated on a selection of realistic problems. In the third and final part of this thesis, we explore ways of efficiently adapting existing models to new domains or tasks. Our contribution to this topic takes inspiration from information geometry to derive a new gradient update rule which alleviate catastrophic forgetting issues during adaptation.

生成模型 · MoDELS · 表示學習 · 學成 · 可辨認的 ·

2021 年 6 月 9 日

Generative Models as a Data Source for Multiview Representation Learning

Ali Jahanian,Xavier Puig,Yonglong Tian,Phillip Isola

Generative models are now capable of producing highly realistic images that look nearly indistinguishable from the data on which they are trained. This raises the question: if we have good enough generative models, do we still need datasets? We investigate this question in the setting of learning general-purpose visual representations from a black-box generative model rather than directly from data. Given an off-the-shelf image generator without any access to its training data, we train representations from the samples output by this generator. We compare several representation learning methods that can be applied to this setting, using the latent space of the generator to generate multiple "views" of the same semantic content. We show that for contrastive methods, this multiview data can naturally be used to identify positive pairs (nearby in latent space) and negative pairs (far apart in latent space). We find that the resulting representations rival those learned directly from real data, but that good performance requires care in the sampling strategy applied and the training method. Generative models can be viewed as a compressed and organized copy of a dataset, and we envision a future where more and more "model zoos" proliferate while datasets become increasingly unwieldy, missing, or private. This paper suggests several techniques for dealing with visual representation learning in such a future. Code is released on our project page: //ali-design.github.io/GenRep/

Vision · 模型評估 · 可約的 · 計算機視覺 · DNN ·

2020 年 3 月 24 日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Abhinav Goel,Caleb Tung,Yung-Hsiang Lu,George K. Thiruvathukal

from arxiv, Accepted for publication at 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA 2020

Deep neural networks (DNNs) are successful in many computer vision tasks. However, the most accurate DNNs require millions of parameters and operations, making them energy, computation and memory intensive. This impedes the deployment of large DNNs in low-power devices with limited compute resources. Recent research improves DNN models by reducing the memory requirement, energy consumption, and number of operations without significantly decreasing the accuracy. This paper surveys the progress of low-power deep learning and computer vision, specifically in regards to inference, and discusses the methods for compacting and accelerating DNN models. The techniques can be divided into four major categories: (1) parameter quantization and pruning, (2) compressed convolutional filters and matrix factorization, (3) network architecture search, and (4) knowledge distillation. We analyze the accuracy, advantages, disadvantages, and potential solutions to the problems with the techniques in each category. We also discuss new evaluation metrics as a guideline for future research.

圖片分類 · 生成式對抗網絡 · Networking · 未標記 · GANs ·

2018 年 2 月 10 日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Zilong Zhong,Jonathan Li

from arxiv, Accepted by AAAI-18

High spectral dimensionality and the shortage of annotations make hyperspectral image (HSI) classification a challenging problem. Recent studies suggest that convolutional neural networks can learn discriminative spatial features, which play a paramount role in HSI interpretation. However, most of these methods ignore the distinctive spectral-spatial characteristic of hyperspectral data. In addition, a large amount of unlabeled data remains an unexploited gold mine for efficient data use. Therefore, we proposed an integration of generative adversarial networks (GANs) and probabilistic graphical models for HSI classification. Specifically, we used a spectral-spatial generator and a discriminator to identify land cover categories of hyperspectral cubes. Moreover, to take advantage of a large amount of unlabeled data, we adopted a conditional random field to refine the preliminary classification results generated by GANs. Experimental results obtained using two commonly studied datasets demonstrate that the proposed framework achieved encouraging classification accuracy using a small number of data for training.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

state-of-the-art

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='stg72'><strong id='stg72'></strong><small id='stg72'></small><button id='stg72'></button><li id='stg72'><noscript id='stg72'><big id='stg72'></big><dt id='stg72'></dt></noscript></li></tr><ol id='stg72'><option id='stg72'><table id='stg72'><blockquote id='stg72'><tbody id='stg72'></tbody></blockquote></table></option></ol><u id='stg72'></u><kbd id='stg72'><kbd id='stg72'></kbd></kbd>

<code id='stg72'><strong id='stg72'></strong></code>

<fieldset id='stg72'></fieldset>

<span id='stg72'></span>

<ins id='stg72'></ins>

<acronym id='stg72'><em id='stg72'></em><td id='stg72'><div id='stg72'></div></td></acronym><address id='stg72'><big id='stg72'><big id='stg72'></big><legend id='stg72'></legend></big></address>

<i id='stg72'><div id='stg72'><ins id='stg72'></ins></div></i>

<i id='stg72'></i>