亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='apeb2'></tfoot>

<legend id='apeb2'><style id='apeb2'><dir id='apeb2'><q id='apeb2'></q></dir></style></legend>

<i id='apeb2'><tr id='apeb2'><dt id='apeb2'><q id='apeb2'><span id='apeb2'><b id='apeb2'><form id='apeb2'><ins id='apeb2'></ins><ul id='apeb2'></ul><sub id='apeb2'></sub></form><legend id='apeb2'></legend><bdo id='apeb2'><pre id='apeb2'><center id='apeb2'></center></pre></bdo></b><th id='apeb2'></th></span></q></dt></tr></i><div id='apeb2'><tfoot id='apeb2'></tfoot><dl id='apeb2'><fieldset id='apeb2'></fieldset></dl></div>

<li id='apeb2'><abbr id='apeb2'></abbr></li>

·

Performer · 可約的 · MoDELS · AI · Machine Learning ·

2021 年 12 月 29 日

MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation

Alexandros Karargyris,Renato Umeton,Micah J. Sheller,Alejandro Aristizabal,Johnu George,Srini Bala,Daniel J. Beutel,Victor Bittorf,Akshay Chaudhari,Alexander Chowdhury,Cody Coleman,Bala Desinghu,Gregory Diamos,Debo Dutta,Diane Feddema,Grigori Fursin,Junyi Guo,Xinyuan Huang,David Kanter,Satyananda Kashyap,Nicholas Lane,Indranil Mallick,Pietro Mascagni,Virendra Mehta,Vivek Natarajan,Nikola Nikolov,Nicolas Padoy,Gennady Pekhimenko,Vijay Janapa Reddi,G Anthony Reina,Pablo Ribalta,Jacob Rosenthal,Abhishek Singh,Jayaraman J. Thiagarajan,Anna Wuest,Maria Xenochristou,Daguang Xu,Poonam Yadav,Michael Rosenthal,Massimo Loda,Jason M. Johnson,Peter Mattson

Medical AI has tremendous potential to advance healthcare by supporting the evidence-based practice of medicine, personalizing patient treatment, reducing costs, and improving provider and patient experience. We argue that unlocking this potential requires a systematic way to measure the performance of medical AI models on large-scale heterogeneous data. To meet this need, we are building MedPerf, an open framework for benchmarking machine learning in the medical domain. MedPerf will enable federated evaluation in which models are securely distributed to different facilities for evaluation, thereby empowering healthcare organizations to assess and verify the performance of AI models in an efficient and human-supervised process, while prioritizing privacy. We describe the current challenges healthcare and AI communities face, the need for an open platform, the design philosophy of MedPerf, its current implementation status, and our roadmap. We call for researchers and organizations to join us in creating the MedPerf open benchmarking platform.

相關內容

Performer

任務對話系統 · 數據集 · 可理解性 · 相互獨立的 · Machine Learning ·

2022 年 4 月 19 日

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets

Wei Chen,Zhiwei Li,Hongyi Fang,Qianyuan Yao,Cheng Zhong,Jianye Hao,Qi Zhang,Xuanjing Huang,J iajie Peng,Zhongyu Wei

from arxiv, 8 pages, 5 figures, 10 tables

In recent years, interest has arisen in using machine learning to improve the efficiency of automatic medical consultation and enhance patient experience. In this paper, we propose two frameworks to support automatic medical consultation, namely doctor-patient dialogue understanding and task-oriented interaction. A new large medical dialogue dataset with multi-level fine-grained annotations is introduced and five independent tasks are established, including named entity recognition, dialogue act classification, symptom label inference, medical report generation and diagnosis-oriented dialogue policy. We report a set of benchmark results for each task, which shows the usability of the dataset and sets a baseline for future studies.

聯邦學習 · 學成 · 模型評估 · 講稿 · MoDELS ·

2022 年 4 月 19 日

Deep Federated Learning for Autonomous Driving

Anh Nguyen,Tuong Do,Minh Tran,Binh X. Nguyen,Chien Duong,Tu Phan,Erman Tjiputra,Quang D. Tran

from arxiv, Accepted in IEEE Intelligent Vehicles Symposium 2022 (IV 2022)

Autonomous driving is an active research topic in both academia and industry. However, most of the existing solutions focus on improving the accuracy by training learnable models with centralized large-scale data. Therefore, these methods do not take into account the user's privacy. In this paper, we present a new approach to learn autonomous driving policy while respecting privacy concerns. We propose a peer-to-peer Deep Federated Learning (DFL) approach to train deep architectures in a fully decentralized manner and remove the need for central orchestration. We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods. Intensively experimental results on three datasets show that our approach with FADNet and DFL achieves superior accuracy compared with other recent methods. Furthermore, our approach can maintain privacy by not collecting user data to a central server.

Performer · 控制器 · 前向 · 可辨認的 · AI ·

2022 年 4 月 19 日

When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward

Jiayang Song,Deyun Lyu,Zhenya Zhang,Zhijie Wang,Tianyi Zhang,Lei Ma

Cyber-physical systems (CPS) have been broadly deployed in safety-critical domains, such as automotive systems, avionics, medical devices, etc. In recent years, Artificial Intelligence (AI) has been increasingly adopted to control CPS. Despite the popularity of AI-enabled CPS, few benchmarks are publicly available. There is also a lack of deep understanding on the performance and reliability of AI-enabled CPS across different industrial domains. To bridge this gap, we initiate to create a public benchmark of industry-level CPS in seven domains and build AI controllers for them via state-of-the-art deep reinforcement learning (DRL) methods. Based on that, we further perform a systematic evaluation of these AI-enabled systems with their traditional counterparts to identify the current challenges and explore future opportunities. Our key findings include (1) AI controllers do not always outperform traditional controllers, (2) existing CPS testing techniques (falsification, specifically) fall short of analyzing AI-enabled CPS, and (3) building a hybrid system that strategically combines and switches between AI controllers and traditional controllers can achieve better performance across different domains. Our results highlight the need for new testing techniques for AI-enabled CPS and the need for more investigations into hybrid CPS systems to achieve optimal performance and reliability.

潛變量/隱變量 · 生成模型 · MoDELS · Performer · 學成 ·

2022 年 4 月 18 日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Ali Ghadirzadeh,Petra Poklukar,Karol Arndt,Chelsea Finn,Ville Kyrki,Danica Kragic,M?rten Bj?rkman

from arxiv, arXiv admin note: substantial text overlap with arXiv:2007.13134

We present a data-efficient framework for solving sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. The framework, called GenRL, trains deep policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, and (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable. GenRL enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, we provide a set of measures for evaluation of generative models such that we are able to predict the performance of the RL policy training prior to the actual training on a physical robot. We experimentally determine the characteristics of generative models that have most influence on the performance of the final policy training on two robotics tasks: shooting a hockey puck and throwing a basketball. Furthermore, we empirically demonstrate that GenRL is the only method which can safely and efficiently solve the robotics tasks compared to two state-of-the-art RL methods.

Extensibility · Performer · 講稿 · 數據集 · Performance ·

2022 年 4 月 18 日

LwHBench: A low-level hardware component benchmark and dataset for Single Board Computers

Pedro Miguel Sánchez Sánchez,José María Jorquera Valero,Alberto Huertas Celdrán,Gér?me Bovet,Manuel Gil Pérez,Gregorio Martínez Pérez

In today's computing environment, where Artificial Intelligence (AI) and data processing are moving toward the Internet of Things (IoT) and the Edge computing paradigm, benchmarking resource-constrained devices is a critical task to evaluate their suitability and performance. The literature has extensively explored the performance of IoT devices when running high-level benchmarks specialized in particular application scenarios, such as AI or medical applications. However, lower-level benchmarking applications and datasets that analyze the hardware components of each device are needed. This low-level device understanding enables new AI solutions for network, system and service management based on device performance, such as individual device identification, so it is an area worth exploring more in detail. In this paper, we present LwHBench, a low-level hardware benchmarking application for Single-Board Computers that measures the performance of CPU, GPU, Memory and Storage taking into account the component constraints in these types of devices. LwHBench has been implemented for Raspberry Pi devices and run for 100 days on a set of 45 devices to generate an extensive dataset that allows the usage of AI techniques in different application scenarios. Finally, to demonstrate the inter-scenario capability of the created dataset, a series of AI-enabled use cases about device identification and context impact on performance are presented as examples and exploration of the published data.

聯邦學習 · 學成 · Networking · 可約的 · 講稿 ·

2022 年 4 月 18 日

A Practical Cross-Device Federated Learning Framework over 5G Networks

Wenti Yang,Naiyu Wang,Zhitao Guan,Longfei Wu,Xiaojiang Du,Mohsen Guizani

from arxiv, This paper has been accepted by IEEE Wireless Communications

The concept of federated learning (FL) was first proposed by Google in 2016. Thereafter, FL has been widely studied for the feasibility of application in various fields due to its potential to make full use of data without compromising the privacy. However, limited by the capacity of wireless data transmission, the employment of federated learning on mobile devices has been making slow progress in practical. The development and commercialization of the 5th generation (5G) mobile networks has shed some light on this. In this paper, we analyze the challenges of existing federated learning schemes for mobile devices and propose a novel cross-device federated learning framework, which utilizes the anonymous communication technology and ring signature to protect the privacy of participants while reducing the computation overhead of mobile devices participating in FL. In addition, our scheme implements a contribution-based incentive mechanism to encourage mobile users to participate in FL. We also give a case study of autonomous driving. Finally, we present the performance evaluation of the proposed scheme and discuss some open issues in federated learning.

多峰值 · 可理解性 · 講稿 · INTERACT · 數據集 ·

2022 年 4 月 17 日

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Thomas Hayes,Songyang Zhang,Xi Yin,Guan Pang,Sasha Sheng,Harry Yang,Songwei Ge,Isabelle Hu,Devi Parikh

Multimodal video-audio-text understanding and generation can benefit from datasets that are narrow but rich. The narrowness allows bite-sized challenges that the research community can make progress on. The richness ensures we are making progress along the core challenges. To this end, we present a large-scale video-audio-text dataset MUGEN, collected using the open-sourced platform game CoinRun [11]. We made substantial modifications to make the game richer by introducing audio and enabling new interactions. We trained RL agents with different objectives to navigate the game and interact with 13 objects and characters. This allows us to automatically extract a large collection of diverse videos and associated audio. We sample 375K video clips (3.2s each) and collect text descriptions from human annotators. Each video has additional annotations that are extracted automatically from the game engine, such as accurate semantic maps for each frame and templated textual descriptions. Altogether, MUGEN can help progress research in many tasks in multimodal understanding and generation. We benchmark representative approaches on tasks involving video-audio-text retrieval and generation. Our dataset and code are released at: //mugen-org.github.io/.

MoDELS · 語言模型化 · CASE · AIM · 表示 ·

2022 年 4 月 15 日

Evaluation Benchmarks for Spanish Sentence Representations

Vladimir Araujo,Andrés Carvallo,Souvik Kundu,José Ca?ete,Marcelo Mendoza,Robert E. Mercer,Felipe Bravo-Marquez,Marie-Francine Moens,Alvaro Soto

from arxiv, Accepted paper at LREC2022

Due to the success of pre-trained language models, versions of languages other than English have been released in recent years. This fact implies the need for resources to evaluate these models. In the case of Spanish, there are few ways to systematically assess the models' quality. In this paper, we narrow the gap by building two evaluation benchmarks. Inspired by previous work (Conneau and Kiela, 2018; Chen et al., 2019), we introduce Spanish SentEval and Spanish DiscoEval, aiming to assess the capabilities of stand-alone and discourse-aware sentence representations, respectively. Our benchmarks include considerable pre-existing and newly constructed datasets that address different tasks from various domains. In addition, we evaluate and analyze the most recent pre-trained Spanish language models to exhibit their capabilities and limitations. As an example, we discover that for the case of discourse evaluation tasks, mBERT, a language model trained on multiple languages, usually provides a richer latent representation than models trained only with documents in Spanish. We hope our contribution will motivate a fairer, more comparable, and less cumbersome way to evaluate future Spanish language models.

Machine Learning · 學成 · ML · 機器學習模型 · domain shift ·

2022 年 4 月 15 日

Rethinking Machine Learning Model Evaluation in Pathology

Syed Ashar Javed,Dinkar Juyal,Zahil Shanis,Shreya Chakraborty,Harsha Pokkalla,Aaditya Prakash

from arxiv, ICLR 2022 ML Evaluation Workshop

Machine Learning has been applied to pathology images in research and clinical practice with promising outcomes. However, standard ML models often lack the rigorous evaluation required for clinical decisions. Machine learning techniques for natural images are ill-equipped to deal with pathology images that are significantly large and noisy, require expensive labeling, are hard to interpret, and are susceptible to spurious correlations. We propose a set of practical guidelines for ML evaluation in pathology that address the above concerns. The paper includes measures for setting up the evaluation framework, effectively dealing with variability in labels, and a recommended suite of tests to address issues related to domain shift, robustness, and confounding variables. We hope that the proposed framework will bridge the gap between ML researchers and domain experts, leading to wider adoption of ML techniques in pathology and improving patient outcomes.

學成 · 泛化理論 · AIM · state-of-the-art · 強化學習 ·

2019 年 10 月 24 日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Tianhe Yu,Deirdre Quillen,Zhanpeng He,Ryan Julian,Karol Hausman,Chelsea Finn,Sergey Levine

from arxiv, CoRL 2019. Videos are here: meta-world.github.io and open-sourced codes are available at: //github.com/rlworkgroup/metaworld

Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current research on meta-reinforcement learning focuses on task distributions that are very narrow. For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as different tasks. When policies are meta-trained on such narrow task distributions, they cannot possibly generalize to more quickly acquire entirely new tasks. Therefore, if the aim of these methods is to enable faster acquisition of entirely new behaviors, we must evaluate them on task distributions that are sufficiently broad to enable generalization to new behaviors. In this paper, we propose an open-source simulated benchmark for meta-reinforcement learning and multi-task learning consisting of 50 distinct robotic manipulation tasks. Our aim is to make it possible to develop algorithms that generalize to accelerate the acquisition of entirely new, held-out tasks. We evaluate 6 state-of-the-art meta-reinforcement learning and multi-task learning algorithms on these tasks. Surprisingly, while each task and its variations (e.g., with different object positions) can be learned with reasonable success, these algorithms struggle to learn with multiple tasks at the same time, even with as few as ten distinct training tasks. Our analysis and open-source environments pave the way for future research in multi-task learning and meta-learning that can enable meaningful generalization, thereby unlocking the full potential of these methods.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Machine Learning

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='apeb2'></tfoot>

<legend id='apeb2'><style id='apeb2'><dir id='apeb2'><q id='apeb2'></q></dir></style></legend>

<i id='apeb2'><tr id='apeb2'><dt id='apeb2'><q id='apeb2'><span id='apeb2'><b id='apeb2'><form id='apeb2'><ins id='apeb2'></ins><ul id='apeb2'></ul><sub id='apeb2'></sub></form><legend id='apeb2'></legend><bdo id='apeb2'><pre id='apeb2'><center id='apeb2'></center></pre></bdo></b><th id='apeb2'></th></span></q></dt></tr></i><div id='apeb2'><tfoot id='apeb2'></tfoot><dl id='apeb2'><fieldset id='apeb2'></fieldset></dl></div>