亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='lsn65'><strong id='lsn65'></strong><small id='lsn65'></small><button id='lsn65'></button><li id='lsn65'><noscript id='lsn65'><big id='lsn65'></big><dt id='lsn65'></dt></noscript></li></tr><ol id='lsn65'><option id='lsn65'><table id='lsn65'><blockquote id='lsn65'><tbody id='lsn65'></tbody></blockquote></table></option></ol><u id='lsn65'></u><kbd id='lsn65'><kbd id='lsn65'></kbd></kbd>

<code id='lsn65'><strong id='lsn65'></strong></code>

<fieldset id='lsn65'></fieldset>

<span id='lsn65'></span>

<ins id='lsn65'></ins>

<acronym id='lsn65'><em id='lsn65'></em><td id='lsn65'><div id='lsn65'></div></td></acronym><address id='lsn65'><big id='lsn65'><big id='lsn65'></big><legend id='lsn65'></legend></big></address>

<i id='lsn65'><div id='lsn65'><ins id='lsn65'></ins></div></i>

<i id='lsn65'></i>

·

Continuity · INFORMS · 數據集 · Machine Learning · Prompt ·

2021 年 10 月 18 日

The Problem of Zombie Datasets:A Framework For Deprecating Datasets

Frances Corry,Hamsini Sridharan,Alexandra Sasha Luccioni,Mike Ananny,Jason Schultz,Kate Crawford

What happens when a machine learning dataset is deprecated for legal, ethical, or technical reasons, but continues to be widely used? In this paper, we examine the public afterlives of several prominent deprecated or redacted datasets, including ImageNet, 80 Million Tiny Images, MS-Celeb-1M, Duke MTMC, Brainwash, and HRT Transgender, in order to inform a framework for more consistent, ethical, and accountable dataset deprecation. Building on prior research, we find that there is a lack of consistency, transparency, and centralized sourcing of information on the deprecation of datasets, and as such, these datasets and their derivatives continue to be cited in papers and circulate online. These datasets that never die -- which we term "zombie datasets" -- continue to inform the design of production-level systems, causing technical, legal, and ethical challenges; in so doing, they risk perpetuating the harms that prompted their supposed withdrawal, including concerns around bias, discrimination, and privacy. Based on this analysis, we propose a Dataset Deprecation Framework that includes considerations of risk, mitigation of impact, appeal mechanisms, timeline, post-deprecation protocol, and publication checks that can be adapted and implemented by the machine learning community. Drawing on work on datasheets and checklists, we further offer two sample dataset deprecation sheets and propose a centralized repository that tracks which datasets have been deprecated and could be incorporated into the publication protocols of venues like NeurIPS.

相關內容

Continuity

讓 iOS 8 和 OS X Yosemite 無縫切換的一個新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source:

可辨認的 · Facebook AI Research · Performer · TOOLS · 有向 ·

2022 年 2 月 10 日

Assessing the Fairness of AI Systems: AI Practitioners' Processes, Challenges, and Needs for Support

Michael Madaio,Lisa Egede,Hariharan Subramonyam,Jennifer Wortman Vaughan,Hanna Wallach

from arxiv, Camera-ready preprint of paper accepted to the CSCW conference

Various tools and practices have been developed to support practitioners in identifying, assessing, and mitigating fairness-related harms caused by AI systems. However, prior research has highlighted gaps between the intended design of these tools and practices and their use within particular contexts, including gaps caused by the role that organizational factors play in shaping fairness work. In this paper, we investigate these gaps for one such practice: disaggregated evaluations of AI systems, intended to uncover performance disparities between demographic groups. By conducting semi-structured interviews and structured workshops with thirty-three AI practitioners from ten teams at three technology companies, we identify practitioners' processes, challenges, and needs for support when designing disaggregated evaluations. We find that practitioners face challenges when choosing performance metrics, identifying the most relevant direct stakeholders and demographic groups on which to focus, and collecting datasets with which to conduct disaggregated evaluations. More generally, we identify impacts on fairness work stemming from a lack of engagement with direct stakeholders or domain experts, business imperatives that prioritize customers over marginalized groups, and the drive to deploy AI systems at scale.

TEAM · 可理解性 · 回合 · TOOLS · 規范化的 ·

2022 年 2 月 10 日

Work-from-home and its implication for project management, resilience and innovation -- a global survey on software companies

Anh Nguyen-Duc,Dron Khanna,Des Greer,Xiaofeng Wang,Luciana Martinez Zaina,Gerardo Matturro,Jorge Melegati,Eduardo Guerra,Giang Huong Le,Petri Kettunen,Sami Hyrynsalmi,Henry Edison,Afonso Sales,Didzis Rutitis,Kai-Kristian Kemell,Abdullah Aldaeej,Tommi Mikkonen,Juan Garbajosa,Pekka Abrahamsson

[Context] The COVID-19 pandemic has had a disruptive impact on how people work and collaborate across all global economic sectors, including the software business. While remote working is not new for software engineers, forced Work-from-home situations to come with both constraints, limitations, and opportunities for individuals, software teams and software companies. As the "new normal" for working might be based on the current state of Work From Home (WFH), it is useful to understand what has happened and learn from that. [Objective] The goal of this study is to gain insights on how their WFH environment impacts software projects and software companies. We are also interested in understanding if the impact differs between software startups and established companies. [Method] We conducted a global-scale, cross-sectional survey during spring and summer 2021. Our results are based on quantitative and qualitative analysis of 297 valid responses. [Results] We observed a mixed perception of the impact of WFH on software project management, resilience, and innovation. Certain patterns on WFH, control and coordination mechanisms and collaborative tools are observed globally. We find that team, agility and leadership are the three most important factors for achieving resilience during the pandemic. Although startups do not perceive the impact of WFH differently, there is a difference between engineers who work in a small team context and those who work in a large team context. [Conclusion] The result suggests a contingency approach in studying and improving WFH practices and environment in the future software industry.

Conformer · 測試數據 · MoDELS · 置信度 · 輸入分布 ·

2022 年 2 月 10 日

Conformal prediction for the design problem

Clara Fannjiang,Stephen Bates,Anastasios Angelopoulos,Jennifer Listgarten,Michael I. Jordan

from arxiv, 30 pages, 7 figures

In many real-world deployments of machine learning, we use a prediction algorithm to choose what data to test next. For example, in the protein design problem, we have a regression model that predicts some real-valued property of a protein sequence, which we use to propose new sequences believed to exhibit higher property values than observed in the training data. Since validating designed sequences in the wet lab is typically costly, it is important to know how much we can trust the model's predictions. In such settings, however, there is a distinct type of distribution shift between the training and test data: one where the training and test data are statistically dependent, as the latter is chosen based on the former. Consequently, the model's error on the test data -- that is, the designed sequences -- has some non-trivial relationship with its error on the training data. Herein, we introduce a method to quantify predictive uncertainty in such settings. We do so by constructing confidence sets for predictions that account for the dependence between the training and test data. The confidence sets we construct have finite-sample guarantees that hold for any prediction algorithm, even when a trained model chooses the test-time input distribution. As a motivating use case, we demonstrate how our method quantifies uncertainty for the predicted fitness of designed protein using real data sets.

曲率 · 可約的 · 估計/估計量 · Machine Learning · 推斷 ·

2022 年 2 月 9 日

A Hybrid Inference System for Improved Curvature Estimation in the Level-Set Method Using Machine Learning

Luis ángel Larios-Cárdenas,Frédéric Gibou

from arxiv, Submitted

We present a novel hybrid strategy based on machine learning to improve curvature estimation in the level-set method. The proposed inference system couples enhanced neural networks with standard numerical schemes to compute curvature more accurately. The core of our hybrid framework is a switching mechanism that relies on well established numerical techniques to gauge curvature. If the curvature magnitude is larger than a resolution-dependent threshold, it uses a neural network to yield a better approximation. Our networks are multilayer perceptrons fitted to synthetic data sets composed of sinusoidal- and circular-interface samples at various configurations. To reduce data set size and training complexity, we leverage the problem's characteristic symmetry and build our models on just half of the curvature spectrum. These savings lead to a powerful inference system able to outperform any of its numerical or neural component alone. Experiments with stationary, smooth interfaces show that our hybrid solver is notably superior to conventional numerical methods in coarse grids and along steep interface regions. Compared to prior research, we have observed outstanding gains in precision after training the regression model with data pairs from more than a single interface type and transforming data with specialized input preprocessing. In particular, our findings confirm that machine learning is a promising venue for reducing or removing mass loss in the level-set method.

Performer · 聯邦學習 · Extensibility · 學成 · 分解的 ·

2021 年 2 月 21 日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Chengxu Yang,QiPeng Wang,Mengwei Xu,Zhenpeng Chen,Kaigui Bian,Yunxin Liu,Xuanzhe Liu

Federated learning (FL) is an emerging, privacy-preserving machine learning paradigm, drawing tremendous attention in both academia and industry. A unique characteristic of FL is heterogeneity, which resides in the various hardware specifications and dynamic states across the participating devices. Theoretically, heterogeneity can exert a huge influence on the FL training process, e.g., causing a device unavailable for training or unable to upload its model updates. Unfortunately, these impacts have never been systematically studied and quantified in existing FL literature. In this paper, we carry out the first empirical study to characterize the impacts of heterogeneity in FL. We collect large-scale data from 136k smartphones that can faithfully reflect heterogeneity in real-world settings. We also build a heterogeneity-aware FL platform that complies with the standard FL protocol but with heterogeneity in consideration. Based on the data and the platform, we conduct extensive experiments to compare the performance of state-of-the-art FL algorithms under heterogeneity-aware and heterogeneity-unaware settings. Results show that heterogeneity causes non-trivial performance degradation in FL, including up to 9.2% accuracy drop, 2.32x lengthened training time, and undermined fairness. Furthermore, we analyze potential impact factors and find that device failure and participant bias are two potential factors for performance degradation. Our study provides insightful implications for FL practitioners. On the one hand, our findings suggest that FL algorithm designers consider necessary heterogeneity during the evaluation. On the other hand, our findings urge system providers to design specific mechanisms to mitigate the impacts of heterogeneity.

Extensibility · 語義相似度 · 塊 · 相似度 · 圖 ·

2020 年 8 月 28 日

Linked Credibility Reviews for Explainable Misinformation Detection

Ronald Denaux,Jose Manuel Gomez-Perez

from arxiv, Accepted to the 19th International Semantic Web Conference (ISWC 2020) //iswc2020.semanticweb.org

In recent years, misinformation on the Web has become increasingly rampant. The research community has responded by proposing systems and challenges, which are beginning to be useful for (various subtasks of) detecting misinformation. However, most proposed systems are based on deep learning techniques which are fine-tuned to specific domains, are difficult to interpret and produce results which are not machine readable. This limits their applicability and adoption as they can only be used by a select expert audience in very specific settings. In this paper we propose an architecture based on a core concept of Credibility Reviews (CRs) that can be used to build networks of distributed bots that collaborate for misinformation detection. The CRs serve as building blocks to compose graphs of (i) web content, (ii) existing credibility signals --fact-checked claims and reputation reviews of websites--, and (iii) automatically computed reviews. We implement this architecture on top of lightweight extensions to Schema.org and services providing generic NLP tasks for semantic similarity and stance detection. Evaluations on existing datasets of social-media posts, fake news and political speeches demonstrates several advantages over existing systems: extensibility, domain-independence, composability, explainability and transparency via provenance. Furthermore, we obtain competitive results without requiring finetuning and establish a new state of the art on the Clef'18 CheckThat! Factuality task.

分布式機器學習 · Machine Learning · 學成 · Storage · 優化器 ·

2019 年 9 月 18 日

Distributed Machine Learning on Mobile Devices: A Survey

Renjie Gu,Shuo Yang,Fan Wu

In recent years, mobile devices have gained increasingly development with stronger computation capability and larger storage. Some of the computation-intensive machine learning and deep learning tasks can now be run on mobile devices. To take advantage of the resources available on mobile devices and preserve users' privacy, the idea of mobile distributed machine learning is proposed. It uses local hardware resources and local data to solve machine learning sub-problems on mobile devices, and only uploads computation results instead of original data to contribute to the optimization of the global model. This architecture can not only relieve computation and storage burden on servers, but also protect the users' sensitive information. Another benefit is the bandwidth reduction, as various kinds of local data can now participate in the training process without being uploaded to the server. In this paper, we provide a comprehensive survey on recent studies of mobile distributed machine learning. We survey a number of widely-used mobile distributed machine learning methods. We also present an in-depth discussion on the challenges and future directions in this area. We believe that this survey can demonstrate a clear overview of mobile distributed machine learning and provide guidelines on applying mobile distributed machine learning to real applications.

遷移學習 · 學成 · 目標檢測 · Neural Networks · Processing（編程語言） ·

2018 年 11 月 12 日

A Framework of Transfer Learning in Object Detection for Embedded Systems

Ioannis Athanasiadis,Panagiotis Mousouliotis,Loukas Petrou

Transfer learning is one of the subjects undergoing intense study in the area of machine learning. In object recognition and object detection there are known experiments for the transferability of parameters, but not for neural networks which are suitable for object-detection in real time embedded applications, such as the SqueezeDet neural network. We use transfer learning to accelerate the training of SqueezeDet to a new group of classes. Also, experiments are conducted to study the transferability and co-adaptation phenomena introduced by the transfer learning process. To accelerate training, we propose a new implementation of the SqueezeDet training which provides a faster pipeline for data processing and achieves $1.8$ times speedup compared to the initial implementation. Finally, we created a mechanism for automatic hyperparamer optimization using an empirical method.

目標跟蹤 · 數據集 · Extensibility · Performer · Facebook AI Research ·

2018 年 3 月 28 日

TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

Matthias Müller,Adel Bibi,Silvio Giancola,Salman Al-Subaihi,Bernard Ghanem

from arxiv, preprint

Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild. We provide more than 30K videos with more than 14 million dense bounding box annotations. Our dataset covers a wide selection of object classes in broad and diverse context. By releasing such a large-scale dataset, we expect deep trackers to further improve and generalize. In addition, we introduce a new benchmark composed of 500 novel videos, modeled with a distribution similar to our training dataset. By sequestering the annotation of the test set and providing an online evaluation server, we provide a fair benchmark for future development of object trackers. Deep trackers fine-tuned on a fraction of our dataset improve their performance by up to 1.6% on OTB100 and up to 1.7% on TrackingNet Test. We provide an extensive benchmark on TrackingNet by evaluating more than 20 trackers. Our results suggest that object tracking in the wild is far from being solved.

GANs · 判別器 · SimPLe · 層 · Networks ·

2017 年 11 月 3 日

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Tero Karras,Timo Aila,Samuli Laine,Jaakko Lehtinen

from arxiv, A few clarifications in the appendix

We describe a new training methodology for generative adversarial networks. The key idea is to grow both the generator and discriminator progressively: starting from a low resolution, we add new layers that model increasingly fine details as training progresses. This both speeds the training up and greatly stabilizes it, allowing us to produce images of unprecedented quality, e.g., CelebA images at 1024^2. We also propose a simple way to increase the variation in generated images, and achieve a record inception score of 8.80 in unsupervised CIFAR10. Additionally, we describe several implementation details that are important for discouraging unhealthy competition between the generator and discriminator. Finally, we suggest a new metric for evaluating GAN results, both in terms of image quality and variation. As an additional contribution, we construct a higher-quality version of the CelebA dataset.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Machine Learning

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='TbFdN'><strong id='FYJzc'></strong><small id='XozlA'></small><button id='gdrog'></button><li id='IcCNC'><noscript id='fu25j'><big id='2DiFI'></big><dt id='QyP1t'></dt></noscript></li></tr><ol id='SWF3Z'><option id='uXtXU'><table id='y21tn'><blockquote id='QI3LI'><tbody id='k7Ppl'></tbody></blockquote></table></option></ol><u id='tg5NU'></u><kbd id='RxA5x'><kbd id='WGJYh'></kbd></kbd>

<code id='k9Z0i'><strong id='8yWzB'></strong></code>

<fieldset id='L2CoT'></fieldset>

<span id='CcR2h'></span>

<ins id='hQZXw'></ins>

<acronym id='5B3KJ'><em id='LOTfq'></em><td id='qEdKX'><div id='UBVHq'></div></td></acronym><address id='Jt0Q6'><big id='1Rvr3'><big id='x1PNP'></big><legend id='4UVZe'></legend></big></address>

<i id='ZqZAi'><div id='rcE3r'><ins id='ZunmM'></ins></div></i>

<i id='u8PYo'></i>