亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='ranRe'></li>

_{^{<dd id='14gzK'><tbody id='SAMrM'><td id='ysl3L'><optgroup id='OaF5d'><strong id='JhpNe'></strong></optgroup><address id='4lSUz'><ul id='MlbXG'></ul></address><big id='LxGhT'></big></td><table id='iHX5p'></table></tbody><pre id='3myDh'></pre></dd><span id='J2tvY'><b id='TNCuR'></b></span>}}


<dfn id='VzLB2'><optgroup id='fKX0Z'></optgroup></dfn><tfoot id='Oan7J'><bdo id='V5BVz'><div id='4dzvc'></div><i id='ezAhi'><dt id='lFuBS'></dt></i></bdo></tfoot>

_{<fieldset id='stL7N'></fieldset>}

·

自動問答 · Pivotal（公司） · Performer · HTTPS · 語言模型化 ·

2023 年 10 月 20 日

Question Answering as Programming for Solving Time-Sensitive Questions

Xinyu Zhu,Cheng Yang,Bei Chen,Siheng Li,Jian-Guang Lou,Yujiu Yang

from arxiv, Accepted to EMNLP 2023 Main Conference

Question answering plays a pivotal role in human daily life because it involves our acquisition of knowledge about the world. However, due to the dynamic and ever-changing nature of real-world facts, the answer can be completely different when the time constraint in the question changes. Recently, Large Language Models (LLMs) have shown remarkable intelligence in question answering, while our experiments reveal that the aforementioned problems still pose a significant challenge to existing LLMs. This can be attributed to the LLMs' inability to perform rigorous reasoning based on surface-level text semantics. To overcome this limitation, rather than requiring LLMs to directly answer the question, we propose a novel approach where we reframe the $\textbf{Q}$uestion $\textbf{A}$nswering task $\textbf{a}$s $\textbf{P}$rogramming ($\textbf{QAaP}$). Concretely, by leveraging modern LLMs' superior capability in understanding both natural language and programming language, we endeavor to harness LLMs to represent diversely expressed text as well-structured code and select the best matching answer from multiple candidates through programming. We evaluate our QAaP framework on several time-sensitive question answering datasets and achieve decent improvement, up to $14.5$% over strong baselines. Our codes and data are available at //github.com/TianHongZXY/qaap

相關內容

自動問答

自動(dong)問答

自(zi)動(dong)(dong)問答(da)（Question Answering, QA）是指利用(yong)計算機自(zi)動(dong)(dong)回答(da)用(yong)戶(hu)所提(ti)出(chu)的(de)問題(ti)以(yi)滿足用(yong)戶(hu)知識需求的(de)任務。不(bu)同于現有(you)搜(sou)索(suo)引擎，問答(da)系統(tong)是信息服(fu)務的(de)一種高級(ji)形式，系統(tong)返回用(yong)戶(hu)的(de)不(bu)再是基(ji)于關(guan)鍵詞匹(pi)配排序的(de)文檔列(lie)表，而是精(jing)準的(de)自(zi)然語言答(da)案(an)。近年來(lai)，隨著人工智能的(de)飛速發(fa)(fa)展，自(zi)動(dong)(dong)問答(da)已經成為(wei)倍(bei)受關(guan)注且發(fa)(fa)展前景廣泛的(de)研究(jiu)方向。

知識薈萃

精品入(ru)門(men)和進階教程、論文和代碼整理等

更多

查看相(xiang)關VIP內容、論文、資訊(xun)等

可辨認的 · state-of-the-art · Learning · Automator · Guidance ·

2023 年 12 月 7 日

Deep Learning for Hate Speech Detection: A Comparative Study

Jitendra Singh Malik,Hezhe Qiao,Guansong Pang,Anton van den Hengel

from arxiv, 18 pages, 4 figures, and 6 tables

Automated hate speech detection is an important tool in combating the spread of hate speech, particularly in social media. Numerous methods have been developed for the task, including a recent proliferation of deep-learning based approaches. A variety of datasets have also been developed, exemplifying various manifestations of the hate-speech detection problem. We present here a large-scale empirical comparison of deep and shallow hate-speech detection methods, mediated through the three most commonly used datasets. Our goal is to illuminate progress in the area, and identify strengths and weaknesses in the current state-of-the-art. We particularly focus our analysis on measures of practical performance, including detection accuracy, computational efficiency, capability in using pre-trained models, and domain generalization. In doing so we aim to provide guidance as to the use of hate-speech detection in practice, quantify the state-of-the-art, and identify future research directions. Code and dataset are available at //github.com/jmjmalik22/Hate-Speech-Detection.

Learning · Processing（編程語言） · Neural Networks · Networks · 深度學習 ·

2023 年 12 月 5 日

zkDL: Efficient Zero-Knowledge Proofs of Deep Learning Training

Haochen Sun,Tonghe Bai,Jason Li,Hongyang Zhang

from arxiv, 16 pages

The recent advancements in deep learning have brought about significant changes in various aspects of people's lives. Meanwhile, these rapid developments have raised concerns about the legitimacy of the training process of deep neural networks. To protect the intellectual properties of AI developers, directly examining the training process by accessing the model parameters and training data is often prohibited for verifiers. In response to this challenge, we present zero-knowledge deep learning (zkDL), an efficient zero-knowledge proof for deep learning training. To address the long-standing challenge of verifiable computations of non-linearities in deep learning training, we introduce zkReLU, a specialized proof for the ReLU activation and its backpropagation. zkReLU turns the disadvantage of non-arithmetic relations into an advantage, leading to the creation of FAC4DNN, our specialized arithmetic circuit design for modelling neural networks. This design aggregates the proofs over different layers and training steps, without being constrained by their sequential order in the training process. With our new CUDA implementation that achieves full compatibility with the tensor structures and the aggregated proof design, zkDL enables the generation of complete and sound proofs in less than a second per batch update for an 8-layer neural network with 10M parameters and a batch size of 64, while provably ensuring the privacy of data and model parameters. To our best knowledge, we are not aware of any existing work on zero-knowledge proof of deep learning training that is scalable to million-size networks.

Performer · Extensibility · state-of-the-art · LIDAR · HTTPS ·

2023 年 12 月 5 日

NeuRAD: Neural Rendering for Autonomous Driving

Adam Tonderski,Carl Lindstr?m,Georg Hess,William Ljungbergh,Lennart Svensson,Christoffer Petersson

Neural radiance fields (NeRFs) have gained popularity in the autonomous driving (AD) community. Recent methods show NeRFs' potential for closed-loop simulation, enabling testing of AD systems, and as an advanced training data augmentation technique. However, existing methods often require long training times, dense semantic supervision, or lack generalizability. This, in turn, hinders the application of NeRFs for AD at scale. In this paper, we propose NeuRAD, a robust novel view synthesis method tailored to dynamic AD data. Our method features simple network design, extensive sensor modeling for both camera and lidar -- including rolling shutter, beam divergence and ray dropping -- and is applicable to multiple datasets out of the box. We verify its performance on five popular AD datasets, achieving state-of-the-art performance across the board. To encourage further development, we will openly release the NeuRAD source code. See //github.com/georghess/NeuRAD .

motivation · TOOLS · 生成式人工智能 · 大語言模型 · 可約的 ·

2023 年 12 月 5 日

Tweetorial Hooks: Generative AI Tools to Motivate Science on Social Media

Tao Long,Dorothy Zhang,Grace Li,Batool Taraif,Samia Menon,Kynnedy Simone Smith,Sitong Wang,Katy Ilonka Gero,Lydia B. Chilton

from arxiv, 10 pages, 10 figures. Proceedings of the 14th International Conference on Computational Creativity (ICCC'23)

Communicating science and technology is essential for the public to understand and engage in a rapidly changing world. Tweetorials are an emerging phenomenon where experts explain STEM topics on social media in creative and engaging ways. However, STEM experts struggle to write an engaging "hook" in the first tweet that captures the reader's attention. We propose methods to use large language models (LLMs) to help users scaffold their process of writing a relatable hook for complex scientific topics. We demonstrate that LLMs can help writers find everyday experiences that are relatable and interesting to the public, avoid jargon, and spark curiosity. Our evaluation shows that the system reduces cognitive load and helps people write better hooks. Lastly, we discuss the importance of interactivity with LLMs to preserve the correctness, effectiveness, and authenticity of the writing.

SLAM · 稀疏 · state-of-the-art · 傳感器 · 圖 ·

2023 年 12 月 4 日

Efficient 2D Graph SLAM for Sparse Sensing

Hanzhi Zhou,Zichao Hu,Sihang Liu,Samira Khan

from arxiv, Accepted for 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Simultaneous localization and mapping (SLAM) plays a vital role in mapping unknown spaces and aiding autonomous navigation. Virtually all state-of-the-art solutions today for 2D SLAM are designed for dense and accurate sensors such as laser range-finders (LiDARs). However, these sensors are not suitable for resource-limited nano robots, which become increasingly capable and ubiquitous nowadays, and these robots tend to mount economical and low-power sensors that can only provide sparse and noisy measurements. This introduces a challenging problem called SLAM with sparse sensing. This work addresses the problem by adopting the form of the state-of-the-art graph-based SLAM pipeline with a novel frontend and an improvement for loop closing in the backend, both of which are designed to work with sparse and uncertain range data. Experiments show that the maps constructed by our algorithm have superior quality compared to prior works on sparse sensing. Furthermore, our method is capable of running in real-time on a modern PC with an average processing time of 1/100th the input interval time.

contrastive · 對比學習 · 學成 · Extensibility · 稀疏 ·

2022 年 3 月 25 日

Versatile Multi-Modal Pre-Training for Human-Centric Perception

Fangzhou Hong,Liang Pan,Zhongang Cai,Ziwei Liu

from arxiv, CVPR 2022; Project Page //hongfz16.github.io/projects/HCMoCo.html; Codes available at //github.com/hongfz16/HCMoCo

Human-centric perception plays a vital role in vision and graphics. But their data annotations are prohibitively expensive. Therefore, it is desirable to have a versatile pre-train model that serves as a foundation for data-efficient downstream tasks transfer. To this end, we propose the Human-Centric Multi-Modal Contrastive Learning framework HCMoCo that leverages the multi-modal nature of human data (e.g. RGB, depth, 2D keypoints) for effective representation learning. The objective comes with two main challenges: dense pre-train for multi-modality data, efficient usage of sparse human priors. To tackle the challenges, we design the novel Dense Intra-sample Contrastive Learning and Sparse Structure-aware Contrastive Learning targets by hierarchically learning a modal-invariant latent space featured with continuous and ordinal feature distribution and structure-aware semantic consistency. HCMoCo provides pre-train for different modalities by combining heterogeneous datasets, which allows efficient usage of existing task-specific human data. Extensive experiments on four downstream tasks of different modalities demonstrate the effectiveness of HCMoCo, especially under data-efficient settings (7.16% and 12% improvement on DensePose Estimation and Human Parsing). Moreover, we demonstrate the versatility of HCMoCo by exploring cross-modality supervision and missing-modality inference, validating its strong ability in cross-modal association and reasoning.

推薦系統 · 學成 · 強化學習 · 策略搜索 · INTERACT ·

2021 年 9 月 22 日

A Survey on Reinforcement Learning for Recommender Systems

Yuanguo Lin,Yong Liu,Fan Lin,Pengcheng Wu,Wenhua Zeng,Chunyan Miao

from arxiv, 25 pages, 4 figures

Recommender systems have been widely applied in different real-life scenarios to help us find useful information. Recently, Reinforcement Learning (RL) based recommender systems have become an emerging research topic. It often surpasses traditional recommendation models even most deep learning-based methods, owing to its interactive nature and autonomous learning ability. Nevertheless, there are various challenges of RL when applying in recommender systems. Toward this end, we firstly provide a thorough overview, comparisons, and summarization of RL approaches for five typical recommendation scenarios, following three main categories of RL: value-function, policy search, and Actor-Critic. Then, we systematically analyze the challenges and relevant solutions on the basis of existing literature. Finally, under discussion for open issues of RL and its limitations of recommendation, we highlight some potential research directions in this field.

圖像字幕 · state-of-the-art · Vision · 可辨認的 · 語言模型化 ·

2021 年 7 月 14 日

From Show to Tell: A Survey on Image Captioning

Matteo Stefanini,Marcella Cornia,Lorenzo Baraldi,Silvia Cascianelli,Giuseppe Fiameni,Rita Cucchiara

Connecting Vision and Language plays an essential role in Generative Intelligence. For this reason, in the last few years, a large research effort has been devoted to image captioning, i.e. the task of describing images with syntactically and semantically meaningful sentences. Starting from 2015 the task has generally been addressed with pipelines composed of a visual encoding step and a language model for text generation. During these years, both components have evolved considerably through the exploitation of object regions, attributes, and relationships and the introduction of multi-modal connections, fully-attentive approaches, and BERT-like early-fusion strategies. However, regardless of the impressive results obtained, research in image captioning has not reached a conclusive answer yet. This work aims at providing a comprehensive overview and categorization of image captioning approaches, from visual encoding and text generation to training strategies, used datasets, and evaluation metrics. In this respect, we quantitatively compare many relevant state-of-the-art approaches to identify the most impactful technical innovations in image captioning architectures and training strategies. Moreover, many variants of the problem and its open challenges are analyzed and discussed. The final goal of this work is to serve as a tool for understanding the existing state-of-the-art and highlighting the future directions for an area of research where Computer Vision and Natural Language Processing can find an optimal synergy.

掩碼 · BERT · MoDELS · 掩碼語言模型化 · Extensibility ·

2019 年 6 月 19 日

Pre-Training with Whole Word Masking for Chinese BERT

Yiming Cui,Wanxiang Che,Ting Liu,Bing Qin,Ziqing Yang,Shijin Wang,Guoping Hu

from arxiv, 10 pages

Bidirectional Encoder Representations from Transformers (BERT) has shown marvelous improvements across various NLP tasks. Recently, an upgraded version of BERT has been released with Whole Word Masking (WWM), which mitigate the drawbacks of masking partial WordPiece tokens in pre-training BERT. In this technical report, we adapt whole word masking in Chinese text, that masking the whole word instead of masking Chinese characters, which could bring another challenge in Masked Language Model (MLM) pre-training task. The model was trained on the latest Chinese Wikipedia dump. We aim to provide easy extensibility and better performance for Chinese BERT without changing any neural architecture or even hyper-parameters. The model is verified on various NLP tasks, across sentence-level to document-level, including sentiment classification (ChnSentiCorp, Sina Weibo), named entity recognition (People Daily, MSRA-NER), natural language inference (XNLI), sentence pair matching (LCQMC, BQ Corpus), and machine reading comprehension (CMRC 2018, DRCD, CAIL RC). Experimental results on these datasets show that the whole word masking could bring another significant gain. Moreover, we also examine the effectiveness of Chinese pre-trained models: BERT, ERNIE, BERT-wwm. We release the pre-trained model (both TensorFlow and PyTorch) on GitHub: //github.com/ymcui/Chinese-BERT-wwm

Performance · 相似度度量 · Performer · state-of-the-art · 圖像檢索 ·

2018 年 4 月 6 日

Cross-Domain Image Matching with Deep Feature Maps

Bailey Kong,James Supancic,Deva Ramanan,Charless C. Fowlkes

We investigate the problem of automatically determining what type of shoe left an impression found at a crime scene. This recognition problem is made difficult by the variability in types of crime scene evidence (ranging from traces of dust or oil on hard surfaces to impressions made in soil) and the lack of comprehensive databases of shoe outsole tread patterns. We find that mid-level features extracted by pre-trained convolutional neural nets are surprisingly effective descriptors for this specialized domains. However, the choice of similarity measure for matching exemplars to a query image is essential to good performance. For matching multi-channel deep features, we propose the use of multi-channel normalized cross-correlation and analyze its effectiveness. Our proposed metric significantly improves performance in matching crime scene shoeprints to laboratory test impressions. We also show its effectiveness in other cross-domain image retrieval problems: matching facade images to segmentation labels and aerial photos to map images. Finally, we introduce a discriminatively trained variant and fine-tune our system through our proposed metric, obtaining state-of-the-art performance.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Pivotal（公司）

語言(yan)模型化

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<li id='90Wv2'></li>

_{^{<dd id='5AZV2'><tbody id='6eplq'><td id='OctPJ'><optgroup id='8ExnI'><strong id='0S5wt'></strong></optgroup><address id='PC8ji'><ul id='vfZck'></ul></address><big id='myFfA'></big></td><table id='6Jc4t'></table></tbody><pre id='z0xWT'></pre></dd><span id='1UC53'><b id='mFDBl'></b></span>}}


<dfn id='pCF4V'><optgroup id='v1mLY'></optgroup></dfn><tfoot id='ZErcH'><bdo id='dngFT'><div id='jtMQK'></div><i id='bJf1k'><dt id='T4nqt'></dt></i></bdo></tfoot>

_{<fieldset id='RGTzD'></fieldset>}