久草精品视频在线观看_亚洲无码一区二区久久_99SE久久这里只有精品6_开心激情五月天色婷婷网_久久无码精品一区二区三区不卡_AV毛片免费久久久久_国产一区二区三区免费高清在线播放

Message passing Graph Neural Networks (GNNs) are known to be limited in expressive power by the 1-WL color-refinement test for graph isomorphism. Other more expressive models either are computationally expensive or need preprocessing to extract structural features from the graph. In this work, we propose to make GNNs universal by guiding the learning process with exact isomorphism solver techniques which operate on the paradigm of Individualization and Refinement (IR), a method to artificially introduce asymmetry and further refine the coloring when 1-WL stops. Isomorphism solvers generate a search tree of colorings whose leaves uniquely identify the graph. However, the tree grows exponentially large and needs hand-crafted pruning techniques which are not desirable from a learning perspective. We take a probabilistic view and approximate the search tree of colorings (i.e. embeddings) by sampling multiple paths from root to leaves of the search tree. To learn more discriminative representations, we guide the sampling process with particle filter updates, a principled approach for sequential state estimation. Our algorithm is end-to-end differentiable, can be applied with any GNN as backbone and learns richer graph representations with only linear increase in runtime. Experimental evaluation shows that our approach consistently outperforms leading GNN models on both synthetic benchmarks for isomorphism detection as well as real-world datasets.

相關內容

圖

關注 6

FAST · TransE · 解碼 · 電氣電子工程師學會 · Performer ·

2024 年 3 月 12 日

Fast list-decoding of univariate multiplicity and folded Reed-Solomon codes

Rohan Goyal,Prahladh Harsha,Mrinal Kumar,Ashutosh Shankar

from arxiv, Modified abstract and included references for HRW and KRRSS. At the time of Version [v1], we were unaware of the nearly-linear-time list-decoders of [HRW] and [KRRSW]. This has been addressed in the subsequent versions

We show that the known list-decoding algorithms for univariate multiplicity and folded Reed-Solomon codes can be made to run in $\tilde{O}(n)$ time. Univariate multiplicity codes and FRS codes are natural variants of Reed-Solomon codes that were discovered and studied for their applications to list decoding. It is known that for every $\epsilon>0$, and rate $r \in (0,1)$, there exist explicit families of these codes that have rate $r$ and can be list decoded from a $(1-r-\epsilon)$ fraction of errors with constant list size in polynomial time (Guruswami & Wang (IEEE Trans. Inform. Theory 2013) and Kopparty, Ron-Zewi, Saraf & Wootters (SIAM J. Comput. 2023)). In this work, we present randomized algorithms that perform the above list-decoding tasks in $\tilde{O}(n)$, where $n$ is the block-length of the code. Our algorithms have two main components. The first component builds upon the lattice-based approach of Alekhnovich (IEEE Trans. Inf. Theory 2005), who designed a $\tilde{O}(n)$ time list-decoding algorithm for Reed-Solomon codes approaching the Johnson radius. As part of the second component, we design $\tilde{O}(n)$ time algorithms for two natural algebraic problems: given a $(m+2)$-variate polynomial $Q(x,y_0,\dots,y_m) = \tilde{Q}(x) + \sum_{i=0}^m Q_i(x)\cdot y_i$ the first algorithm solves order-$m$ linear differential equations of the form $Q\left(x, f(x), \frac{df}{dx}, \dots,\frac{d^m f}{dx^m}\right) \equiv 0$ while the second solves functional equations of the form $Q\left(x, f(x), f(\gamma x), \dots,f(\gamma^m x)\right) \equiv 0$, where $m$ is an arbitrary constant and $\gamma$ is a field element of sufficiently high order. These algorithms can be viewed as generalizations of classical $\tilde{O}(n)$ time algorithms of Sieveking (Computing 1972) and Kung (Numer. Math. 1974) for computing the modular inverse of a power series, and might be of independent interest.

多峰值 · Processing（編程語言） · 查準率/準確率 · Performer · 統計量 ·

2024 年 3 月 11 日

HiRA-Pro: High resolution alignment of multimodal spatio-temporal data: a process physics driven approach

Abhishek Hanchate,Himanshu Balhara,Vishal S. Chindepalli,Satish T. S. Bukkapatnam

We present HiRA-Pro, a novel procedure to align, at high spatio-temporal resolutions, multimodal signals from real-world processes and systems that exhibit diverse transient, nonlinear stochastic dynamics, such as manufacturing machines. It is based on discerning and synchronizing the process signatures of salient kinematic and dynamic events in these disparate signals. HiRA-Pro addresses the challenge of aligning data with sub-millisecond phenomena, where traditional timestamp, external trigger, or clock-based alignment methods fall short. The effectiveness of HiRA-Pro is demonstrated in a smart manufacturing context, where it aligns data from 13+ channels acquired during 3D-printing and milling operations on an Optomec-LENS MTS 500 hybrid machine. The aligned data is then voxelized to generate 0.25 second aligned data chunks that correspond to physical voxels on the produced part. The superiority of HiRA-Pro is further showcased through case studies in additive manufacturing, demonstrating improved machine learning-based predictive performance due to precise multimodal data alignment. Specifically, testing classification accuracies improved by almost 35% with the application of HiRA-Pro, even with limited data, allowing for precise localization of artifacts. The paper also provides a comprehensive discussion on the proposed method, its applications, and comparative qualitative analysis with a few other alignment methods. HiRA-Pro achieves temporal-spatial resolutions of 10-1000 us and 100 um in order to generate datasets that register with physical voxels on the 3D-printed and milled part. These resolutions are at least an order of magnitude finer than the existing alignment methods that employ individual timestamps, statistical correlations, or common clocks, which achieve precision of hundreds of milliseconds.

向量化 · 均值 · 矩陣論 · TOOLS · 似然 ·

2024 年 3 月 9 日

Simultaneous test of the mean vectors and covariance matrices for high-dimensional data using RMT

Zhenzhen Niu,Jianghao Li,Wenya Luo,Zhidong Bai

In this paper, we propose a new modified likelihood ratio test (LRT) for simultaneously testing mean vectors and covariance matrices of two-sample populations in high-dimensional settings. By employing tools from Random Matrix Theory (RMT), we derive the limiting null distribution of the modified LRT for generally distributed populations. Furthermore, we compare the proposed test with existing tests using simulation results, demonstrating that the modified LRT exhibits favorable properties in terms of both size and power.

Gemini · 多峰值 · MoDELS · 查全率/召回率 · 可理解性 ·

2024 年 3 月 8 日

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Machel Reid,Nikolay Savinov,Denis Teplyashin,Dmitry Lepikhin,Timothy Lillicrap,Jean-baptiste Alayrac,Radu Soricut,Angeliki Lazaridou,Orhan Firat,Julian Schrittwieser,Ioannis Antonoglou,Rohan Anil,Sebastian Borgeaud,Andrew Dai,Katie Millican,Ethan Dyer,Mia Glaese,Thibault Sottiaux,Benjamin Lee,Fabio Viola,Malcolm Reynolds,Yuanzhong Xu,James Molloy,Jilin Chen,Michael Isard,Paul Barham,Tom Hennigan,Ross McIlroy,Melvin Johnson,Johan Schalkwyk,Eli Collins,Eliza Rutherford,Erica Moreira,Kareem Ayoub,Megha Goel,Clemens Meyer,Gregory Thornton,Zhen Yang,Henryk Michalewski,Zaheer Abbas,Nathan Schucher,Ankesh Anand,Richard Ives,James Keeling,Karel Lenc,Salem Haykal,Siamak Shakeri,Pranav Shyam,Aakanksha Chowdhery,Roman Ring,Stephen Spencer,Eren Sezener,Luke Vilnis,Oscar Chang,Nobuyuki Morioka,George Tucker,Ce Zheng,Oliver Woodman,Nithya Attaluri,Tomas Kocisky,Evgenii Eltyshev,Xi Chen,Timothy Chung,Vittorio Selo,Siddhartha Brahma,Petko Georgiev,Ambrose Slone,Zhenkai Zhu,James Lottes,Siyuan Qiao,Ben Caine,Sebastian Riedel,Alex Tomala,Martin Chadwick,Juliette Love,Peter Choy,Sid Mittal,Neil Houlsby,Yunhao Tang,Matthew Lamm,Libin Bai,Qiao Zhang,Luheng He,Yong Cheng,Peter Humphreys,Yujia Li,Sergey Brin,Albin Cassirer,Yingjie Miao,Lukas Zilka,Taylor Tobin,Kelvin Xu,Lev Proleev,Daniel Sohn,Alberto Magni,Lisa Anne Hendricks,Isabel Gao,Santiago Onta?ón,Oskar Bunyan,Nathan Byrd,Abhanshu Sharma,Biao Zhang,Mario Pinto,Rishika Sinha,Harsh Mehta,Dawei Jia,Sergi Caelles,Albert Webson,Alex Morris,Becca Roelofs,Yifan Ding,Robin Strudel,Xuehan Xiong,Marvin Ritter,Mostafa Dehghani,Rahma Chaabouni,Abhijit Karmarkar,Guangda Lai,Fabian Mentzer,Bibo Xu,YaGuang Li,Yujing Zhang,Tom Le Paine,Alex Goldin,Behnam Neyshabur,Kate Baumli,Anselm Levskaya,Michael Laskin,Wenhao Jia,Jack W. Rae,Kefan Xiao,Antoine He,Skye Giordano,Lakshman Yagati,Jean-Baptiste Lespiau,Paul Natsev,Sanjay Ganapathy,Fangyu Liu,Danilo Martins,Nanxin Chen,Yunhan Xu,Megan Barnes,Rhys May,Arpi Vezer,Junhyuk Oh,Ken Franko,Sophie Bridgers,Ruizhe Zhao,Boxi Wu,Basil Mustafa,Sean Sechrist,Emilio Parisotto,Thanumalayan Sankaranarayana Pillai,Chris Larkin,Chenjie Gu,Christina Sorokin,Maxim Krikun,Alexey Guseynov,Jessica Landon,Romina Datta,Alexander Pritzel,Phoebe Thacker,Fan Yang,Kevin Hui,Anja Hauth,Chih-Kuan Yeh,David Barker,Justin Mao-Jones,Sophia Austin,Hannah Sheahan,Parker Schuh,James Svensson,Rohan Jain,Vinay Ramasesh,Anton Briukhov,Da-Woon Chung,Tamara von Glehn,Christina Butterfield,Priya Jhakra,Matthew Wiethoff,Justin Frye,Jordan Grimstad,Beer Changpinyo,Charline Le Lan,Anna Bortsova,Yonghui Wu,Paul Voigtlaender,Tara Sainath,Charlotte Smith,Will Hawkins,Kris Cao,James Besley,Srivatsan Srinivasan,Mark Omernick,Colin Gaffney,Gabriela Surita,Ryan Burnell,Bogdan Damoc,Junwhan Ahn,Andrew Brock,Mantas Pajarskas,Anastasia Petrushkina,Seb Noury,Lorenzo Blanco,Kevin Swersky,Arun Ahuja,Thi Avrahami,Vedant Misra,Raoul de Liedekerke,Mariko Iinuma,Alex Polozov,Sarah York,George van den Driessche,Paul Michel,Justin Chiu,Rory Blevins,Zach Gleicher,Adrià Recasens,Alban Rrustemi,Elena Gribovskaya,Aurko Roy,Wiktor Gworek,Séb Arnold,Lisa Lee,James Lee-Thorp,Marcello Maggioni,Enrique Piqueras,Kartikeya Badola,Sharad Vikram,Lucas Gonzalez,Anirudh Baddepudi,Evan Senter,Jacob Devlin,James Qin,Michael Azzam,Maja Trebacz,Martin Polacek,Kashyap Krishnakumar,Shuo-yiin Chang,Matthew Tung,Ivo Penchev,Rishabh Joshi,Kate Olszewska,Carrie Muir,Mateo Wirth,Ale Jakse Hartman,Josh Newlan,Sheleem Kashem,Vijay Bolina,Elahe Dabir,Joost van Amersfoort,Zafarali Ahmed,James Cobon-Kerr,Aishwarya Kamath,Arnar Mar Hrafnkelsson,Le Hou,Ian Mackinnon,Alexandre Frechette,Eric Noland,Xiance Si,Emanuel Taropa,Dong Li,Phil Crone,Anmol Gulati,Sébastien Cevey,Jonas Adler,Ada Ma,David Silver,Simon Tokumine,Richard Powell,Stephan Lee,Michael Chang,Samer Hassan,Diana Mincu,Antoine Yang,Nir Levine,Jenny Brennan,Mingqiu Wang,Sarah Hodkinson,Jeffrey Zhao,Josh Lipschultz,Aedan Pope,Michael B. Chang,Cheng Li,Laurent El Shafey,Michela Paganini,Sholto Douglas,Bernd Bohnet,Fabio Pardo,Seth Odoom,Mihaela Rosca,Cicero Nogueira dos Santos,Kedar Soparkar,Arthur Guez,Tom Hudson,Steven Hansen,Chulayuth Asawaroengchai,Ravi Addanki,Tianhe Yu,Wojciech Stokowiec,Mina Khan,Justin Gilmer,Jaehoon Lee,Carrie Grimes Bostock,Keran Rong,Jonathan Caton,Pedram Pejman,Filip Pavetic,Geoff Brown,Vivek Sharma,Mario Lu?i?,Rajkumar Samuel,Josip Djolonga,Amol Mandhane,Lars Lowe Sj?sund,Elena Buchatskaya,Elspeth White,Natalie Clay,Jiepu Jiang,Hyeontaek Lim,Ross Hemsley,Jane Labanowski,Nicola De Cao,David Steiner,Sayed Hadi Hashemi,Jacob Austin,Anita Gergely,Tim Blyth,Joe Stanton,Kaushik Shivakumar,Aditya Siddhant,Anders Andreassen,Carlos Araya,Nikhil Sethi,Rakesh Shivanna,Steven Hand,Ankur Bapna,Ali Khodaei,Antoine Miech,Garrett Tanzer,Andy Swing,Shantanu Thakoor,Zhufeng Pan,Zachary Nado,Stephanie Winkler,Dian Yu,Mohammad Saleh,Loren Maggiore,Iain Barr,Minh Giang,Thais Kagohara,Ivo Danihelka,Amit Marathe,Vladimir Feinberg,Mohamed Elhawaty,Nimesh Ghelani,Dan Horgan,Helen Miller,Lexi Walker,Richard Tanburn,Mukarram Tariq,Disha Shrivastava,Fei Xia,Chung-Cheng Chiu,Zoe Ashwood,Khuslen Baatarsukh,Sina Samangooei,Fred Alcober,Axel Stjerngren,Paul Komarek,Katerina Tsihlas,Anudhyan Boral,Ramona Comanescu,Jeremy Chen,Ruibo Liu,Dawn Bloxwich,Charlie Chen,Yanhua Sun,Fangxiaoyu Feng,Matthew Mauger,Xerxes Dotiwalla,Vincent Hellendoorn,Michael Sharman,Ivy Zheng,Krishna Haridasan,Gabe Barth-Maron,Craig Swanson,Dominika Rogozińska,Alek Andreev,Paul Kishan Rubenstein,Ruoxin Sang,Dan Hurt,Gamaleldin Elsayed,Renshen Wang,Dave Lacey,Anastasija Ili?,Yao Zhao,Lora Aroyo,Chimezie Iwuanyanwu,Vitaly Nikolaev,Balaji Lakshminarayanan,Sadegh Jazayeri,Rapha?l Lopez Kaufman,Mani Varadarajan,Chetan Tekur,Doug Fritz,Misha Khalman,David Reitter,Kingshuk Dasgupta,Shourya Sarcar,Tina Ornduff,Javier Snaider,Fantine Huot,Johnson Jia,Rupert Kemp,Nejc Trdin,Anitha Vijayakumar,Lucy Kim,Christof Angermueller,Li Lao,Tianqi Liu,Haibin Zhang,David Engel,Somer Greene,Ana?s White,Jessica Austin,Lilly Taylor,Shereen Ashraf,Dangyi Liu,Maria Georgaki,Irene Cai,Yana Kulizhskaya,Sonam Goenka,Brennan Saeta,Kiran Vodrahalli,Christian Frank,Dario de Cesare,Brona Robenek,Harry Richardson,Mahmoud Alnahlawi,Christopher Yew,Priya Ponnapalli,Marco Tagliasacchi,Alex Korchemniy,Yelin Kim,Dinghua Li,Bill Rosgen,Zoe Ashwood,Kyle Levin,Jeremy Wiesner,Praseem Banzal,Praveen Srinivasan,Hongkun Yu,?a?lar ünlü,David Reid,Zora Tung,Daniel Finchelstein,Ravin Kumar,Andre Elisseeff,Jin Huang,Ming Zhang,Rui Zhu,Ricardo Aguilar,Mai Giménez,Jiawei Xia,Olivier Dousse,Willi Gierke,Soheil Hassas Yeganeh,Damion Yates,Komal Jalan,Lu Li,Eri Latorre-Chimoto,Duc Dung Nguyen,Ken Durden,Praveen Kallakuri,Yaxin Liu,Matthew Johnson,Tomy Tsai,Alice Talbert,Jasmine Liu,Alexander Neitz,Chen Elkind,Marco Selvi,Mimi Jasarevic,Livio Baldini Soares,Albert Cui,Pidong Wang,Alek Wenjiao Wang,Xinyu Ye,Krystal Kallarackal,Lucia Loher,Hoi Lam,Josef Broder,Dan Holtmann-Rice,Nina Martin,Bramandia Ramadhana,Daniel Toyama,Mrinal Shukla,Sujoy Basu,Abhi Mohan,Nick Fernando,Noah Fiedel,Kim Paterson,Hui Li,Ankush Garg,Jane Park,DongHyun Choi,Diane Wu,Sankalp Singh,Zhishuai Zhang,Amir Globerson,Lily Yu,John Carpenter,Félix de Chaumont Quitry,Carey Radebaugh,Chu-Cheng Lin,Alex Tudor,Prakash Shroff,Drew Garmon,Dayou Du,Neera Vats,Han Lu,Shariq Iqbal,Alex Yakubovich,Nilesh Tripuraneni,James Manyika,Haroon Qureshi,Nan Hua,Christel Ngani,Maria Abi Raad,Hannah Forbes,Anna Bulanova,Jeff Stanway,Mukund Sundararajan,Victor Ungureanu,Colton Bishop,Yunjie Li,Balaji Venkatraman,Bo Li,Chloe Thornton,Salvatore Scellato,Nishesh Gupta,Yicheng Wang,Ian Tenney,Xihui Wu,Ashish Shenoy,Gabriel Carvajal,Diana Gage Wright,Ben Bariach,Zhuyun Xiao,Peter Hawkins,Sid Dalmia,Clement Farabet,Pedro Valenzuela,Quan Yuan,Chris Welty,Ananth Agarwal,Mia Chen,Wooyeol Kim,Brice Hulse,Nandita Dukkipati,Adam Paszke,Andrew Bolt,Elnaz Davoodi,Kiam Choo,Jennifer Beattie,Jennifer Prendki,Harsha Vashisht,Rebeca Santamaria-Fernandez,Luis C. Cobo,Jarek Wilkiewicz,David Madras,Ali Elqursh,Grant Uy,Kevin Ramirez,Matt Harvey,Tyler Liechty,Heiga Zen,Jeff Seibert,Clara Huiyi Hu,Mohamed Elhawaty,Andrey Khorlin,Maigo Le,Asaf Aharoni,Megan Li,Lily Wang,Sandeep Kumar,Alejandro Lince,Norman Casagrande,Jay Hoover,Dalia El Badawy,David Soergel,Denis Vnukov,Matt Miecnikowski,Jiri Simsa,Anna Koop,Praveen Kumar,Thibault Sellam,Daniel Vlasic,Samira Daruki,Nir Shabat,John Zhang,Guolong Su,Jiageng Zhang,Jeremiah Liu,Yi Sun,Evan Palmer,Alireza Ghaffarkhah,Xi Xiong,Victor Cotruta,Michael Fink,Lucas Dixon,Ashwin Sreevatsa,Adrian Goedeckemeyer,Alek Dimitriev,Mohsen Jafari,Remi Crocker,Nicholas FitzGerald,Aviral Kumar,Sanjay Ghemawat,Ivan Philips,Frederick Liu,Yannie Liang,Rachel Sterneck,Alena Repina,Marcus Wu,Laura Knight,Marin Georgiev,Hyo Lee,Harry Askham,Abhishek Chakladar,Annie Louis,Carl Crous,Hardie Cate,Dessie Petrova,Michael Quinn,Denese Owusu-Afriyie,Achintya Singhal,Nan Wei,Solomon Kim,Damien Vincent,Milad Nasr,Christopher A. Choquette-Choo,Reiko Tojo,Shawn Lu,Diego de Las Casas,Yuchung Cheng,Tolga Bolukbasi,Katherine Lee,Saaber Fatehi,Rajagopal Ananthanarayanan,Miteyan Patel,Charbel Kaed,Jing Li,Jakub Sygnowski,Shreyas Rammohan Belle,Zhe Chen,Jaclyn Konzelmann,Siim P?der,Roopal Garg,Vinod Koverkathu,Adam Brown,Chris Dyer,Rosanne Liu,Azade Nova,Jun Xu,Slav Petrov,Demis Hassabis,Koray Kavukcuoglu,Jeffrey Dean,Oriol Vinyals

In this report, we present the latest model of the Gemini family, Gemini 1.5 Pro, a highly compute-efficient multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Gemini 1.5 Pro achieves near-perfect recall on long-context retrieval tasks across modalities, improves the state-of-the-art in long-document QA, long-video QA and long-context ASR, and matches or surpasses Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5 Pro's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 2.1 (200k) and GPT-4 Turbo (128k). Finally, we highlight surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.

圖 · 路徑 · binary · AIM · Pair ·

2024 年 3 月 8 日

Path eccentricity of $k$-AT-free graphs and application on graphs with the consecutive ones property

Paul Bastide,Claire Hilaire,Eileen Robinson

The central path problem is a variation on the single facility location problem. The aim is to find, in a given connected graph $G$, a path $P$ minimizing its eccentricity, which is the maximal distance from $P$ to any vertex of the graph $G$. The path eccentricity of $G$ is the minimal eccentricity achievable over all paths in $G$. In this article we consider the path eccentricity of the class of the $k$-AT-free graphs. They are graphs in which any set of three vertices contains a pair for which every path between them uses at least one vertex of the closed neighborhood at distance $k$ of the third. We prove that they have path eccentricity bounded by $k$. Moreover, we answer a question of G\'omez and Guti\'errez asking if there is a relation between path eccentricity and the consecutive ones property. The latter is the property for a binary matrix to admit a permutation of the rows placing the 1's consecutively on the columns. It was already known that graphs whose adjacency matrices have the consecutive ones property have path eccentricity at most 1, and that the same remains true when the augmented adjacency matrices (with ones on the diagonal) has the consecutive ones property. We generalize these results as follow. We study graphs whose adjacency matrices can be made to satisfy the consecutive ones property after changing some values on the diagonal, and show that those graphs have path eccentricity at most 2, by showing that they are 2-AT-free.

相同 · 講稿 · 論文 ·

2024 年 3 月 8 日

A conventional expansion of first-order Belnap-Dunn logic

C. A. Middelburg

from arxiv, 18 pages, major revision of version v5 (the study of the interdefinability of expansions of Belnap-Dunn logic moved to arXiv:2403.04641)

This paper concerns an expansion of first-order Belnap-Dunn logic whose connectives and quantifiers are all familiar from classical logic. The language and logical consequence relation of the logic are defined, a proof system for the defined logic is presented, and the soundness and completeness of the presented proof system is established. The close relationship between the logical consequence relations of the defined logic and the version of classical logic with the same language is illustrated by the minor differences between the presented proof system and a sound and complete proof system for the version of classical logic with the same language. Moreover, fifteen classical laws of logical equivalence are given by which the logical equivalence relation of the defined logic distinguishes itself from the logical equivalence relation of many logics that are closely related at first glance.

Microsoft Surface · Learning · 3D · 帶符號距離 · Nuance ·

2024 年 3 月 8 日

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park,Mirae Do,YeonJae Shin,Jaeseok Yoo,Jongkwang Hong,Joongrock Kim,Chul Lee

Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric integrity of room layouts while also capturing intricate surface details of specific objects. A cornerstone of our two-phase learning framework is the introduction of the Object Surface Field (OSF), a novel concept designed to mitigate the persistent vanishing gradient problem that has previously hindered the capture of high-frequency details in other methods. Our proposed approach is validated through several experiments that include ablation studies.

Pair · CycleGAN · GANs · MoDELS · 卷積 ·

2024 年 3 月 5 日

(Un)paired signal-to-signal translation with 1D conditional GANs

Eric Easthope

I show that a one-dimensional (1D) conditional generative adversarial network (cGAN) with an adversarial training architecture is capable of unpaired signal-to-signal ("sig2sig") translation. Using a simplified CycleGAN model with 1D layers and wider convolutional kernels, mirroring WaveGAN to reframe two-dimensional (2D) image generation as 1D audio generation, I show that recasting the 2D image-to-image translation task to a 1D signal-to-signal translation task with deep convolutional GANs is possible without substantial modification to the conventional U-Net model and adversarial architecture developed as CycleGAN. With this I show for a small tunable dataset that noisy test signals unseen by the 1D CycleGAN model and without paired training transform from the source domain to signals similar to paired test signals in the translated domain, especially in terms of frequency, and I quantify these differences in terms of correlation and error.

語言模型化 · MoDELS · IR · 似然 · 掩碼語言模型化 ·

2020 年 10 月 20 日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Xinyu Ma,Jiafeng Guo,Ruqing Zhang,Yixing Fan,Xiang Ji,Xueqi Cheng

from arxiv, Accepted by WSDM2021

Recently pre-trained language representation models such as BERT have shown great success when fine-tuned on downstream tasks including information retrieval (IR). However, pre-training objectives tailored for ad-hoc retrieval have not been well explored. In this paper, we propose Pre-training with Representative wOrds Prediction (PROP) for ad-hoc retrieval. PROP is inspired by the classical statistical language model for IR, specifically the query likelihood model, which assumes that the query is generated as the piece of text representative of the "ideal" document. Based on this idea, we construct the representative words prediction (ROP) task for pre-training. Given an input document, we sample a pair of word sets according to the document language model, where the set with higher likelihood is deemed as more representative of the document. We then pre-train the Transformer model to predict the pairwise preference between the two word sets, jointly with the Masked Language Model (MLM) objective. By further fine-tuning on a variety of representative downstream ad-hoc retrieval tasks, PROP achieves significant improvements over baselines without pre-training or with other pre-training methods. We also show that PROP can achieve exciting performance under both the zero- and low-resource IR settings. The code and pre-trained models are available at //github.com/Albert-Ma/PROP.

Pegasus · Performer · state-of-the-art · MoDELS · ROUGE ·

2020 年 6 月 2 日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Jingqing Zhang,Yao Zhao,Mohammad Saleh,Peter J. Liu

from arxiv, Added Human Evaluation results; Code link added; Accepted for ICML 2020

Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for abstractive text summarization have not been explored. Furthermore there is a lack of systematic evaluation across diverse domains. In this work, we propose pre-training large Transformer-based encoder-decoder models on massive text corpora with a new self-supervised objective. In PEGASUS, important sentences are removed/masked from an input document and are generated together as one output sequence from the remaining sentences, similar to an extractive summary. We evaluated our best PEGASUS model on 12 downstream summarization tasks spanning news, science, stories, instructions, emails, patents, and legislative bills. Experiments demonstrate it achieves state-of-the-art performance on all 12 downstream datasets measured by ROUGE scores. Our model also shows surprising performance on low-resource summarization, surpassing previous state-of-the-art results on 6 datasets with only 1000 examples. Finally we validated our results using human evaluation and show that our model summaries achieve human performance on multiple datasets.