国产一区二区高清无码,国产自愉一二三四五区,人妻AV鲁丝一区二区三区下载

The single shortest path algorithm is undefined for weighted finite-state automata over non-idempotent semirings because such semirings do not guarantee the existence of a shortest path. However, in non-idempotent semirings admitting an order satisfying a monotonicity condition (such as the plus-times or log semirings), the notion of shortest string is well-defined. We describe an algorithm which finds the shortest string for a weighted non-deterministic automaton over such semirings using the backwards shortest distance of an equivalent deterministic automaton (DFA) as a heuristic for A* search performed over a companion idempotent semiring, which is proven to return the shortest string. While there may be exponentially more states in the DFA, this algorithm needs to visit only a small fraction of them if determinization is performed "on the fly".

相關內容

Performer

關注 10

Learning · 方差 · 在線 · Analysis · 優化器 ·

2022 年 6 月 6 日

A Regret-Variance Trade-Off in Online Learning

Dirk van der Hoeven,Nikita Zhivotovskiy,Nicolò Cesa-Bianchi

We consider prediction with expert advice for strongly convex and bounded losses, and investigate trade-offs between regret and "variance" (i.e., squared difference of learner's predictions and best expert predictions). With $K$ experts, the Exponentially Weighted Average (EWA) algorithm is known to achieve $O(\log K)$ regret. We prove that a variant of EWA either achieves a negative regret (i.e., the algorithm outperforms the best expert), or guarantees a $O(\log K)$ bound on both variance and regret. Building on this result, we show several examples of how variance of predictions can be exploited in learning. In the online to batch analysis, we show that a large empirical variance allows to stop the online to batch conversion early and outperform the risk of the best predictor in the class. We also recover the optimal rate of model selection aggregation when we do not consider early stopping. In online prediction with corrupted losses, we show that the effect of corruption on the regret can be compensated by a large variance. In online selective sampling, we design an algorithm that samples less when the variance is large, while guaranteeing the optimal regret bound in expectation. In online learning with abstention, we use a similar term as the variance to derive the first high-probability $O(\log K)$ regret bound in this setting. Finally, we extend our results to the setting of online linear regression.

再生核希爾伯特空間 · 核化 · 經驗風險最小化 · 經驗風險 · 近似 ·

2022 年 6 月 6 日

RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

Ting-Jui Chang,Shahin Shahrampour

In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method, where the Hessian is approximated via sub-samples of data. In RKHS, however, the dependence of the penalty function to kernel makes standard sub-sampling approaches inapplicable, since the gram matrix is not readily available in a low-rank form. In this paper, we observe that for this class of problems, one can naturally use kernel approximation to speed up the Newton method. Focusing on randomized features for kernel approximation, we provide a novel second-order algorithm that enjoys local superlinear convergence and global linear convergence (with high probability). We derive the theoretical lower bound for the number of random features required for the approximated Hessian to be close to the true Hessian in the norm sense. Our numerical experiments on real-world data verify the efficiency of our method compared to several benchmarks.

估計/估計量 · 最大似然估計 · 極大似然 · INFORMS · 似然 ·

2022 年 6 月 6 日

Finite-Sample Maximum Likelihood Estimation of Location

Shivam Gupta,Jasper C. H. Lee,Eric Price,Paul Valiant

from arxiv, In submission

We consider 1-dimensional location estimation, where we estimate a parameter $\lambda$ from $n$ samples $\lambda + \eta_i$, with each $\eta_i$ drawn i.i.d. from a known distribution $f$. For fixed $f$ the maximum-likelihood estimate (MLE) is well-known to be optimal in the limit as $n \to \infty$: it is asymptotically normal with variance matching the Cram\'er-Rao lower bound of $\frac{1}{n\mathcal{I}}$, where $\mathcal{I}$ is the Fisher information of $f$. However, this bound does not hold for finite $n$, or when $f$ varies with $n$. We show for arbitrary $f$ and $n$ that one can recover a similar theory based on the Fisher information of a smoothed version of $f$, where the smoothing radius decays with $n$.

塑造 · Pair · Extensibility · Continuity · 最優化 ·

2022 年 6 月 6 日

Co-generation of Collision-Free Shapes for Arbitrary One-Parametric Motion

Clinton B. Morris,Morad Behandish

from arxiv, Special Issue on symposium on Solid and Physical Modeling (SPM'2022)

Mechanical assemblies can exhibit complex relative motions, during which collisions between moving parts and their surroundings must be avoided. To define feasible design spaces for each part's shape, "maximal" collision-free pointsets can be computed using configuration space modeling techniques such as Minkowski operations and sweep/unsweep. For example, for a pair of parts undergoing a given relative motion, to make the problem well-posed, the geometry of one part (chosen arbitrarily) must be fixed to compute the maximal shape of the other part by an unsweep operation. Making such arbitrary choices in a multi-component assembly can place unnecessary restrictions on the design space. A broader family of collision-free pairs of parts can be explored, if fixing the geometry of a component is not required. In this paper, we formalize this family of collision-free shapes and introduce a generic method for generating a broad subset of them. Our procedure, which is an extension of the unsweep, allows for co-generation of a pair of geometries which are modified incrementally and simultaneously to avoid collision. We demonstrate the effectiveness and scalability of our procedure in both 2D and 3D by generating a variety of collision-free shapes. Notably, we show that our approach can automatically generate freeform cam and follower profiles, gear teeth, and screw threads, starting from colliding blocks of materials, solely from a specification of relative motion and without the use of any feature-informed heuristics. Moreover, our approach provides continuous measures of collision that can be incorporated into standard gradient-descent design optimization, allowing for simultaneous collision-free and physics-informed co-design of mechanical parts for assembly.

回合 · Automator · 后向 · 軟件工程 · 編程語言 ·

2022 年 6 月 5 日

Geometric Theory for Program Testing

Bernhard Moller,Tony Hoare,Zhe Hou,Jin Song Dong

Formal methods for verification of programs are extended to testing of programs. Their combination is intended to lead to benefits in reliable program development, testing, and evolution. Our geometric theory of testing is intended to serve as the specification of a testing environment, included as the last stage of a toolchain that assists professional programmers, amateurs, and students of Computer Science. The testing environment includes an automated algorithm which locates errors in a test that has been run, and assists in correcting them. It does this by displaying, on a monitor screen, a stick diagram of causal chains in the execution of the program under test. The diagram can then be navigated backwards in the familiar style of a satnav following roads on a map. This will reveal selections of places at which the program should be modified to remove the error.

穩健性 · 估計/估計量 · 核化 · 輸入空間 · UniFormer ·

2022 年 6 月 3 日

Robust Persistence Diagrams using Reproducing Kernels

Siddharth Vishwanath,Kenji Fukumizu,Satoshi Kuriki,Bharath Sriperumbudur

Persistent homology has become an important tool for extracting geometric and topological features from data, whose multi-scale features are summarized in a persistence diagram. From a statistical perspective, however, persistence diagrams are very sensitive to perturbations in the input space. In this work, we develop a framework for constructing robust persistence diagrams from superlevel filtrations of robust density estimators constructed using reproducing kernels. Using an analogue of the influence function on the space of persistence diagrams, we establish the proposed framework to be less sensitive to outliers. The robust persistence diagrams are shown to be consistent estimators in bottleneck distance, with the convergence rate controlled by the smoothness of the kernel. This, in turn, allows us to construct uniform confidence bands in the space of persistence diagrams. Finally, we demonstrate the superiority of the proposed approach on benchmark datasets.

Learning · 核化 · 核嶺回歸 · 早停 · 嶺回歸 ·

2022 年 6 月 3 日

On the Benefits of Large Learning Rates for Kernel Methods

Gaspard Beugnot,Julien Mairal,Alessandro Rudi

from arxiv, Accepted paper at Conference COLT 2022. To be published to Proceedings of Machine Learning Research (PMLR)

This paper studies an intriguing phenomenon related to the good generalization performance of estimators obtained by using large learning rates within gradient descent algorithms. First observed in the deep learning literature, we show that a phenomenon can be precisely characterized in the context of kernel methods, even though the resulting optimization problem is convex. Specifically, we consider the minimization of a quadratic objective in a separable Hilbert space, and show that with early stopping, the choice of learning rate influences the spectral decomposition of the obtained solution on the Hessian's eigenvectors. This extends an intuition described by Nakkiran (2020) on a two-dimensional toy problem to realistic learning scenarios such as kernel ridge regression. While large learning rates may be proven beneficial as soon as there is a mismatch between the train and test objectives, we further explain why it already occurs in classification tasks without assuming any particular mismatch between train and test data distributions.

似然 · 樣本空間 · 樣本 · 泛函 · Analysis ·

2022 年 6 月 3 日

Some parametric tests based on sample spacings

Rahul Singh,Neeraj Misra

from arxiv, 26 pages, 5 figures

Assume that we have a random sample from an absolutely continuous distribution (univariate, or multivariate) with a known functional form and some unknown parameters. In this paper, we have studied several parametric tests based on statistics that are symmetric functions of $m$-step disjoint sample spacings. Asymptotic properties of these tests have been investigated under the simple null hypothesis and under a sequence of local alternatives converging to the null hypothesis. The asymptotic properties of the proposed tests have also been studied under the composite null hypothesis. We observed that these tests have similar asymptotic properties as the likelihood ratio test. Finite sample performances of the proposed tests are assessed numerically. A data analysis based on real data is also reported. The proposed tests provide alternative to similar tests based on simple spacings (i.e., $m=1$), that were proposed earlier in the literature. These tests also provide an alternative to likelihood ratio tests in situations where likelihood function may be unbounded and hence, likelihood ratio tests do not exist.

線性的 · 類別 · 矩陣乘積 · 優化器 · GROUP ·

2022 年 6 月 3 日

Skew constacyclic codes over a class of finite commutative semisimple rings

Ying Zhao

In this article, we study skew constacyclic codes over a class of finite commutative semisimple rings. The automorphism group of $\mathcal{R}=\prod_{i=1}^t F_q$ is determined, and we characterize skew constacyclic codes over ring by linear codes over finite field. We also define homomorphisms which map linear codes over $\mathcal{R}$ to matrix product codes over $F_q,$ some optimal linear codes over finite fields are obtained.

語言模型化 · 代碼 · Learning · TOOLS · 小樣本學習 ·

2022 年 6 月 2 日

Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code

Patrick Barei?,Beatriz Souza,Marcelo d'Amorim,Michael Pradel

from arxiv, 12 pages, 5 figures, submitted to ASE 2022

Few-shot learning with large-scale, pre-trained language models is a powerful way to answer questions about code, e.g., how to complete a given code example, or even generate code snippets from scratch. The success of these models raises the question whether they could serve as a basis for building a wide range code generation tools. Traditionally, such tools are built manually and separately for each task. Instead, few-shot learning may allow to obtain different tools from a single pre-trained language model by simply providing a few examples or a natural language description of the expected tool behavior. This paper studies to what extent a state-of-the-art, pre-trained language model of code, Codex, may serve this purpose. We consider three code manipulation and code generation tasks targeted by a range of traditional tools: (i) code mutation; (ii) test oracle generation from natural language documentation; and (iii) test case generation. For each task, we compare few-shot learning to a manually built tool. Our results show that the model-based tools complement (code mutation), are on par (test oracle generation), or even outperform their respective traditionally built tool (test case generation), while imposing far less effort to develop them. By comparing the effectiveness of different variants of the model-based tools, we provide insights on how to design an appropriate input ("prompt") to the model and what influence the size of the model has. For example, we find that providing a small natural language description of the code generation task is an easy way to improve predictions. Overall, we conclude that few-shot language models are surprisingly effective, yet there is still more work to be done, such as exploring more diverse ways of prompting and tackling even more involved tasks.