亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses of cross validated versions of Trend Filtering or Dyadic CART. To illustrate the generality of the framework we also propose and study cross validated versions of two fundamental estimators; lasso for high dimensional linear regression and singular value thresholding for matrix estimation. Our general framework is inspired by the ideas in Chatterjee and Jafarov (2015) and is potentially applicable to a wide range of estimation methods which use tuning parameters.

相關內容

交(jiao)(jiao)叉驗(yan)(yan)(yan)證(zheng),有時也稱為旋轉估(gu)計(ji)(ji)或(huo)樣(yang)(yang)本外測(ce)(ce)(ce)試,是用(yong)(yong)于(yu)評估(gu)統(tong)計(ji)(ji)結果如(ru)何(he)(he)的(de)(de)(de)各種類(lei)似模(mo)型(xing)(xing)(xing)(xing)驗(yan)(yan)(yan)證(zheng)技術(shu)中的(de)(de)(de)任何(he)(he)一(yi)(yi)(yi)種分析將(jiang)概括為一(yi)(yi)(yi)個(ge)(ge)獨立的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)。它主要用(yong)(yong)于(yu)設置,其目的(de)(de)(de)是預(yu)測(ce)(ce)(ce),和(he)一(yi)(yi)(yi)個(ge)(ge)想要估(gu)計(ji)(ji)如(ru)何(he)(he)準確地一(yi)(yi)(yi)個(ge)(ge)預(yu)測(ce)(ce)(ce)模(mo)型(xing)(xing)(xing)(xing)在(zai)實(shi)踐中執(zhi)行(xing)。在(zai)預(yu)測(ce)(ce)(ce)問題(ti)中,通常會給模(mo)型(xing)(xing)(xing)(xing)一(yi)(yi)(yi)個(ge)(ge)已知數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji),在(zai)該(gai)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)上進(jin)行(xing)訓練(lian)(訓練(lian)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji))以(yi)及(ji)(ji)未知數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)(或(huo)首次看到的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju))的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)(根據(ju)(ju)該(gai)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)測(ce)(ce)(ce)試模(mo)型(xing)(xing)(xing)(xing))(稱為驗(yan)(yan)(yan)證(zheng)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)或(huo)測(ce)(ce)(ce)試集(ji)(ji)(ji))。交(jiao)(jiao)叉驗(yan)(yan)(yan)證(zheng)的(de)(de)(de)目標(biao)是測(ce)(ce)(ce)試模(mo)型(xing)(xing)(xing)(xing)預(yu)測(ce)(ce)(ce)未用(yong)(yong)于(yu)估(gu)計(ji)(ji)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)的(de)(de)(de)新數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)的(de)(de)(de)能力,以(yi)發現諸如(ru)過度擬合或(huo)選擇偏倚(yi)之(zhi)類(lei)的(de)(de)(de)問題(ti),并(bing)提供有關如(ru)何(he)(he)進(jin)行(xing)建模(mo)的(de)(de)(de)見解。該(gai)模(mo)型(xing)(xing)(xing)(xing)將(jiang)推廣到一(yi)(yi)(yi)個(ge)(ge)獨立的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji)(例(li)如(ru),未知數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji),例(li)如(ru)來自實(shi)際問題(ti)的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)集(ji)(ji)(ji))。 一(yi)(yi)(yi)輪交(jiao)(jiao)叉驗(yan)(yan)(yan)證(zheng)涉及(ji)(ji)分割一(yi)(yi)(yi)個(ge)(ge)樣(yang)(yang)品的(de)(de)(de)數(shu)(shu)(shu)(shu)(shu)據(ju)(ju)到互補的(de)(de)(de)子(zi)(zi)集(ji)(ji)(ji),在(zai)一(yi)(yi)(yi)個(ge)(ge)子(zi)(zi)集(ji)(ji)(ji)執(zhi)行(xing)所(suo)述分析(稱為訓練(lian)集(ji)(ji)(ji)),以(yi)及(ji)(ji)驗(yan)(yan)(yan)證(zheng)在(zai)另一(yi)(yi)(yi)子(zi)(zi)集(ji)(ji)(ji)中的(de)(de)(de)分析(稱為驗(yan)(yan)(yan)證(zheng)集(ji)(ji)(ji)合或(huo)測(ce)(ce)(ce)試集(ji)(ji)(ji))。為了(le)減少可變性,在(zai)大多數(shu)(shu)(shu)(shu)(shu)方法中,使用(yong)(yong)不同的(de)(de)(de)分區(qu)執(zhi)行(xing)多輪交(jiao)(jiao)叉驗(yan)(yan)(yan)證(zheng),并(bing)將(jiang)驗(yan)(yan)(yan)證(zheng)結果組(zu)合(例(li)如(ru)取平均值)在(zai)各輪中,以(yi)估(gu)計(ji)(ji)模(mo)型(xing)(xing)(xing)(xing)的(de)(de)(de)預(yu)測(ce)(ce)(ce)性能。 總而(er)言(yan)之(zhi),交(jiao)(jiao)叉驗(yan)(yan)(yan)證(zheng)結合了(le)預(yu)測(ce)(ce)(ce)中適(shi)用(yong)(yong)性的(de)(de)(de)度量(平均),以(yi)得出模(mo)型(xing)(xing)(xing)(xing)預(yu)測(ce)(ce)(ce)性能的(de)(de)(de)更準確估(gu)計(ji)(ji)。

We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying Markov random processes parameterized by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the iterates. In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution. While these two iterates are implemented simultaneously, the former is updated "faster" (using bigger step sizes) than the latter (using smaller step sizes). Our first contribution is to characterize the finite-time complexity of the proposed two-time-scale stochastic gradient method. In particular, we provide explicit formulas for the convergence rates of this method under different structural assumptions, namely, strong convexity, convexity, the Polyak-Lojasiewicz condition, and general non-convexity. We apply our framework to two problems in control and reinforcement learning. First, we look at the standard online actor-critic algorithm over finite state and action spaces and derive a convergence rate of O(k^(-2/5)), which recovers the best known rate derived specifically for this problem. Second, we study an online actor-critic algorithm for the linear-quadratic regulator and show that a convergence rate of O(k^(-2/3)) is achieved. This is the first time such a result is known in the literature. Finally, we support our theoretical analysis with numerical simulations where the convergence rates are visualized.

The variational autoencoder (VAE) is a popular deep latent variable model used to analyse high-dimensional datasets by learning a low-dimensional latent representation of the data. It simultaneously learns a generative model and an inference network to perform approximate posterior inference. Recently proposed extensions to VAEs that can handle temporal and longitudinal data have applications in healthcare, behavioural modelling, and predictive maintenance. However, these extensions do not account for heterogeneous data (i.e., data comprising of continuous and discrete attributes), which is common in many real-life applications. In this work, we propose the heterogeneous longitudinal VAE (HL-VAE) that extends the existing temporal and longitudinal VAEs to heterogeneous data. HL-VAE provides efficient inference for high-dimensional datasets and includes likelihood models for continuous, count, categorical, and ordinal data while accounting for missing observations. We demonstrate our model's efficacy through simulated as well as clinical datasets, and show that our proposed model achieves competitive performance in missing value imputation and predictive accuracy.

Removing noise from the any processed images is very important. Noise should be removed in such a way that important information of image should be preserved. A decisionbased nonlinear algorithm for elimination of band lines, drop lines, mark, band lost and impulses in images is presented in this paper. The algorithm performs two simultaneous operations, namely, detection of corrupted pixels and evaluation of new pixels for replacing the corrupted pixels. Removal of these artifacts is achieved without damaging edges and details. However, the restricted window size renders median operation less effective whenever noise is excessive in that case the proposed algorithm automatically switches to mean filtering. The performance of the algorithm is analyzed in terms of Mean Square Error [MSE], Peak-Signal-to-Noise Ratio [PSNR], Signal-to-Noise Ratio Improved [SNRI], Percentage Of Noise Attenuated [PONA], and Percentage Of Spoiled Pixels [POSP]. This is compared with standard algorithms already in use and improved performance of the proposed algorithm is presented. The advantage of the proposed algorithm is that a single algorithm can replace several independent algorithms which are required for removal of different artifacts.

The problems of frictional contacts are the key to the investigation of mechanical performances of composite materials under varying service environments. The paper considers a linear elasticity system with strongly heterogeneous coefficients and quasistatic Tresca's friction law, and we study the homogenization theories under the frameworks of H-convergence and small $\epsilon$-periodicity. The qualitative result is based on H-convergence, which shows the original oscillating solutions will converge weakly to the homogenized solution, while our quantitative result provides an estimate of asymptotic errors in the $H^1$ norm for the periodic homogenization. We also design several numerical experiments to validate the convergence rates in the quantitative analysis.

We propose a novel framework for learning a low-dimensional representation of data based on nonlinear dynamical systems, which we call dynamical dimension reduction (DDR). In the DDR model, each point is evolved via a nonlinear flow towards a lower-dimensional subspace; the projection onto the subspace gives the low-dimensional embedding. Training the model involves identifying the nonlinear flow and the subspace. Following the equation discovery method, we represent the vector field that defines the flow using a linear combination of dictionary elements, where each element is a pre-specified linear/nonlinear candidate function. A regularization term for the average total kinetic energy is also introduced and motivated by optimal transport theory. We prove that the resulting optimization problem is well-posed and establish several properties of the DDR method. We also show how the DDR method can be trained using a gradient-based optimization method, where the gradients are computed using the adjoint method from optimal control theory. The DDR method is implemented and compared on synthetic and example datasets to other dimension reductions methods, including PCA, t-SNE, and Umap.

We introduce a novel methodology for particle filtering in dynamical systems where the evolution of the signal of interest is described by a SDE and observations are collected instantaneously at prescribed time instants. The new approach includes the discretisation of the SDE and the design of efficient particle filters for the resulting discrete-time state-space model. The discretisation scheme converges with weak order 1 and it is devised to create a sequential dependence structure along the coordinates of the discrete-time state vector. We introduce a class of space-sequential particle filters that exploits this structure to improve performance when the system dimension is large. This is numerically illustrated by a set of computer simulations for a stochastic Lorenz 96 system with additive noise. The new space-sequential particle filters attain approximately constant estimation errors as the dimension of the Lorenz 96 system is increased, with a computational cost that increases polynomially, rather than exponentially, with the system dimension. Besides the new numerical scheme and particle filters, we provide in this paper a general framework for discrete-time filtering in continuous-time dynamical systems described by a SDE and instantaneous observations. Provided that the SDE is discretised using a weakly-convergent scheme, we prove that the marginal posterior laws of the resulting discrete-time state-space model converge to the posterior marginal posterior laws of the original continuous-time state-space model under a suitably defined metric. This result is general and not restricted to the numerical scheme or particle filters specifically studied in this manuscript.

Bearing fault identification and analysis is an important research area in the field of machinery fault diagnosis. Aiming at the common faults of rolling bearings, we propose a data-driven diagnostic algorithm based on the characteristics of bearing vibrations called multi-size kernel based adaptive convolutional neural network (MSKACNN). Using raw bearing vibration signals as the inputs, MSKACNN provides vibration feature learning and signal classification capabilities to identify and analyze bearing faults. Ball mixing is a ball bearing production quality problem that is difficult to identify using traditional frequency domain analysis methods since it requires high frequency resolutions of the measurement signals and results in a long analyzing time. The proposed MSKACNN is shown to improve the efficiency and accuracy of ball mixing diagnosis. To further demonstrate the effectiveness of MSKACNN in bearing fault identification, a bearing vibration data acquisition system was developed, and vibration signal acquisition was performed on rolling bearings under five different fault conditions including ball mixing. The resulting datasets were used to analyze the performance of our proposed model. To validate the adaptive ability of MSKACNN, fault test data from the Case Western Reserve University Bearing Data Center were also used. Test results show that MSKACNN can identify the different bearing conditions with high accuracy with high generalization ability. We presented an implementation of the MSKACNN as a lightweight module for a real-time bearing fault diagnosis system that is suitable for production.

We provide a new analysis of local SGD, removing unnecessary assumptions and elaborating on the difference between two data regimes: identical and heterogeneous. In both cases, we improve the existing theory and provide values of the optimal stepsize and optimal number of local iterations. Our bounds are based on a new notion of variance that is specific to local SGD methods with different data. The tightness of our results is guaranteed by recovering known statements when we plug $H=1$, where $H$ is the number of local steps. The empirical evidence further validates the severe impact of data heterogeneity on the performance of local SGD.

Deep learning techniques have received much attention in the area of image denoising. However, there are substantial differences in the various types of deep learning methods dealing with image denoising. Specifically, discriminative learning based on deep learning can ably address the issue of Gaussian noise. Optimization models based on deep learning are effective in estimating the real noise. However, there has thus far been little related research to summarize the different deep learning techniques for image denoising. In this paper, we offer a comparative study of deep techniques in image denoising. We first classify the deep convolutional neural networks (CNNs) for additive white noisy images; the deep CNNs for real noisy images; the deep CNNs for blind denoising and the deep CNNs for hybrid noisy images, which represents the combination of noisy, blurred and low-resolution images. Then, we analyze the motivations and principles of the different types of deep learning methods. Next, we compare the state-of-the-art methods on public denoising datasets in terms of quantitative and qualitative analysis. Finally, we point out some potential challenges and directions of future research.

The area of Data Analytics on graphs promises a paradigm shift as we approach information processing of classes of data, which are typically acquired on irregular but structured domains (social networks, various ad-hoc sensor networks). Yet, despite its long history, current approaches mostly focus on the optimization of graphs themselves, rather than on directly inferring learning strategies, such as detection, estimation, statistical and probabilistic inference, clustering and separation from signals and data acquired on graphs. To fill this void, we first revisit graph topologies from a Data Analytics point of view, and establish a taxonomy of graph networks through a linear algebraic formalism of graph topology (vertices, connections, directivity). This serves as a basis for spectral analysis of graphs, whereby the eigenvalues and eigenvectors of graph Laplacian and adjacency matrices are shown to convey physical meaning related to both graph topology and higher-order graph properties, such as cuts, walks, paths, and neighborhoods. Next, to illustrate estimation strategies performed on graph signals, spectral analysis of graphs is introduced through eigenanalysis of mathematical descriptors of graphs and in a generic way. Finally, a framework for vertex clustering and graph segmentation is established based on graph spectral representation (eigenanalysis) which illustrates the power of graphs in various data association tasks. The supporting examples demonstrate the promise of Graph Data Analytics in modeling structural and functional/semantic inferences. At the same time, Part I serves as a basis for Part II and Part III which deal with theory, methods and applications of processing Data on Graphs and Graph Topology Learning from data.

北京阿比特科技有限公司