五月丁香四月婷婷激情综合,欧美日韩大片一区二区三区,久久99久久99久久这里只有精品资源,国产综合精品久久亚洲

The extent to which a matching engine can cloud the modelling of underlying order submission and management processes in a financial market remains an unanswered concern with regards to market models. Here we consider a 10-variate Hawkes process with simple rules to simulate common order types which are submitted to a matching engine. Hawkes processes can be used to model the time and order of events, and how these events relate to each other. However, they provide a freedom with regards to implementation mechanics relating to the prices and volumes of injected orders. This allows us to consider a reference Hawkes model and two additional models which have rules that change the behaviour of limit orders. The resulting trade and quote data from the simulations are then calibrated and compared with the original order generating process to determine the extent with which implementation rules can distort model parameters. Evidence from validation and hypothesis tests suggest that the true model specification can be significantly distorted by market mechanics, and that practical considerations not directly due to model specification can be important with regards to model identification within an inherently asynchronous trading environment.

相關內容

MoDELS

關注 43

ACM/IEEE第23屆模型驅動工程語言和系統國際會議，是模型驅動軟件和系統工程的首要會議系列，由ACM-SIGSOFT和IEEE-TCSE支持組織。自1998年以來，模型涵蓋了建模的各個方面，從語言和方法到工具和應用程序。模特的參加者來自不同的背景，包括研究人員、學者、工程師和工業專業人士。MODELS 2019是一個論壇，參與者可以圍繞建模和模型驅動的軟件和系統交流前沿研究成果和創新實踐經驗。今年的版本將為建模社區提供進一步推進建模基礎的機會，并在網絡物理系統、嵌入式系統、社會技術系統、云計算、大數據、機器學習、安全、開源等新興領域提出建模的創新應用以及可持續性。官網鏈接： · 估計/估計量 · 可理解性 · 期望損失 · 無偏估計 ·

2021 年 10 月 15 日

Bias-Variance Tradeoffs in Single-Sample Binary Gradient Estimators

Alexander Shekhovtsov

from arxiv, 22 pages, GCPR 2021

Discrete and especially binary random variables occur in many machine learning models, notably in variational autoencoders with binary latent states and in stochastic binary networks. When learning such models, a key tool is an estimator of the gradient of the expected loss with respect to the probabilities of binary variables. The straight-through (ST) estimator gained popularity due to its simplicity and efficiency, in particular in deep networks where unbiased estimators are impractical. Several techniques were proposed to improve over ST while keeping the same low computational complexity: Gumbel-Softmax, ST-Gumbel-Softmax, BayesBiNN, FouST. We conduct a theoretical analysis of bias and variance of these methods in order to understand tradeoffs and verify the originally claimed properties. The presented theoretical results allow for better understanding of these methods and in some cases reveal serious issues.

PCA · 估計/估計量 · Performer · 近似 · 不變 ·

2021 年 10 月 14 日

PCA Initialization for Approximate Message Passing in Rotationally Invariant Models

Marco Mondelli,Ramji Venkataramanan

from arxiv, 72 pages, 2 figures, appeared in Neural Information Processing Systems (NeurIPS), 2021

We study the problem of estimating a rank-$1$ signal in the presence of rotationally invariant noise-a class of perturbations more general than Gaussian noise. Principal Component Analysis (PCA) provides a natural estimator, and sharp results on its performance have been obtained in the high-dimensional regime. Recently, an Approximate Message Passing (AMP) algorithm has been proposed as an alternative estimator with the potential to improve the accuracy of PCA. However, the existing analysis of AMP requires an initialization that is both correlated with the signal and independent of the noise, which is often unrealistic in practice. In this work, we combine the two methods, and propose to initialize AMP with PCA. Our main result is a rigorous asymptotic characterization of the performance of this estimator. Both the AMP algorithm and its analysis differ from those previously derived in the Gaussian setting: at every iteration, our AMP algorithm requires a specific term to account for PCA initialization, while in the Gaussian case, PCA initialization affects only the first iteration of AMP. The proof is based on a two-phase artificial AMP that first approximates the PCA estimator and then mimics the true AMP. Our numerical simulations show an excellent agreement between AMP results and theoretical predictions, and suggest an interesting open direction on achieving Bayes-optimal performance.

AIM · MINE · 可辨認的 · TOOLS · Processing（編程語言） ·

2021 年 10 月 14 日

Ethics lines and Machine learning: a design and simulation of an Association Rules Algorithm for exploiting the data

Patrici Calvo,Rebeca Egea-Moreno

Data mining techniques offer great opportunities for developing ethics lines, tools for communication, participation and innovation whose main aim is to ensure improvements and compliance with the values, conduct and commitments making up the code of ethics. The aim of this study is to suggest a process for exploiting the data generated by the data generated and collected from an ethics line by extracting rules of association and applying the Apriori algorithm. This makes it possible to identify anomalies and behaviour patterns requiring action to review, correct, promote or expand them, as appropriate. Finally, I offer a simulated application of the Apriori algorithm, supplying it with synthetic data to find out its potential, strengths and limitations.

有偏 · 可辨認的 · 估計/估計量 · 可理解性 · 圖 ·

2021 年 10 月 14 日

A potential outcomes approach to selection bias

Eben Kenah

from arxiv, 25 pages, 14 figures

Along with confounding, selection bias is one of the fundamental threats to the validity of epidemiologic research. Unlike confounding, it has yet to be given a standard definition in terms of potential outcomes. Traditionally, selection bias has been defined as a systematic difference in a measure of the exposure-disease association in the study population and the population eligible for inclusion. This definition depends on the parameterization of the association between exposure and disease. The structural approach to selection bias defines selection bias as a spurious exposure-disease association within the study population that occurs when selection is a collider or a descendant of a collider on a causal path from exposure to disease in the eligible population. This definition covers only selection bias that can occur under the null hypothesis. Here, we propose a definition of selection bias in terms of potential outcomes that identifies selection bias whenever disease risks and exposure prevalences are distorted by the selection of study participants, not just a given measure of association (as in the traditional approach) or all measures of association (as in the structural approach). This definition is nonparametric, so it can be analyzed using causal graphs both under and away from the null. It unifies the theoretical frameworks used to understand selection bias and confounding, explicitly links selection to the estimation of causal effects, distinguishes clearly between internal and external validity, and simplifies the analysis of complex study designs.

模型評估 · 估計/估計量 · MoDELS · Automator · 降維算法 ·

2021 年 10 月 14 日

Does the Layout Really Matter? A Study on Visual Model Accuracy Estimation

Nicolas Grossmann,Jürgen Bernard,Michael Sedlmair,Manuela Waldner

In visual interactive labeling, users iteratively assign labels to data items until the machine model reaches an acceptable accuracy. A crucial step of this process is to inspect the model's accuracy and decide whether it is necessary to label additional elements. In scenarios with no or very little labeled data, visual inspection of the predictions is required. Similarity-preserving scatterplots created through a dimensionality reduction algorithm are a common visualization that is used in these cases. Previous studies investigated the effects of layout and image complexity on tasks like labeling. However, model evaluation has not been studied systematically. We present the results of an experiment studying the influence of image complexity and visual grouping of images on model accuracy estimation. We found that users outperform traditional automated approaches when estimating a model's accuracy. Furthermore, while the complexity of images impacts the overall performance, the layout of the items in the plot has little to no effect on estimations.

相互獨立的 · 隨機變量 · 邊緣化 · 樣本均值 · 邊緣分布 ·

2021 年 10 月 13 日

Counterexamples to the classical central limit theorem for triplewise independent random variables having a common arbitrary margin

Guillaume Boglioni Beaulieu,Pierre Lafaye de Micheaux,Frédéric Ouimet

from arxiv, 15 pages, 5 figures, 1 table

We present a general methodology to construct triplewise independent sequences of random variables having a common but arbitrary marginal distribution $F$ (satisfying very mild conditions). For two specific sequences, we obtain in closed form the asymptotic distribution of the sample mean. It is non-Gaussian (and depends on the specific choice of $F$). This allows us to illustrate the extent of the 'failure' of the classical central limit theorem (CLT) under triplewise independence. Our methodology is simple and can also be used to create, for any integer $K$, new $K$-tuplewise independent sequences that are not mutually independent. For $K \geq 4$, it appears that the sequences created using our methodology do verify a CLT, and we explain heuristically why this is the case.

估計/估計量 · Performer · tuning · 優化器 · Weight ·

2021 年 10 月 13 日

Efficient Estimation in NPIV Models: A Comparison of Various Neural Networks-Based Estimators

Jiafeng Che,Xiaohong Chen,Elie Tamer

We investigate the computational performance of Artificial Neural Networks (ANNs) in semi-nonparametric instrumental variables (NPIV) models of high dimensional covariates that are relevant to empirical work in economics. We focus on efficient estimation of and inference on expectation functionals (such as weighted average derivatives) and use optimal criterion-based procedures (sieve minimum distance or SMD) and novel efficient score-based procedures (ES). Both these procedures use ANN to approximate the unknown function. Then, we provide a detailed practitioner's recipe for implementing these two classes of estimators. This involves the choice of tuning parameters both for the unknown functions (that include conditional expectations) but also for the choice of estimation of the optimal weights in SMD and the Riesz representers used with the ES estimators. Finally, we conduct a large set of Monte Carlo experiments that compares the finite-sample performance in complicated designs that involve a large set of regressors (up to 13 continuous), and various underlying nonlinearities and covariate correlations. Some of the takeaways from our results include: 1) tuning and optimization are delicate especially as the problem is nonconvex; 2) various architectures of the ANNs do not seem to matter for the designs we consider and given proper tuning, ANN methods perform well; 3) stable inferences are more difficult to achieve with ANN estimators; 4) optimal SMD based estimators perform adequately; 5) there seems to be a gap between implementation and approximation theory. Finally, we apply ANN NPIV to estimate average price elasticity and average derivatives in two demand examples.

binary · 樣本 · Continuity · Performer · 估計/估計量 ·

2021 年 10 月 11 日

SMART Binary: Sample Size Calculation for Comparing Adaptive Interventions in SMART studies with Longitudinal Binary Outcomes

John J. Dziak,Daniel Almirall,Walter Dempsey,Catherine Stanger,Inbal Nahum-Shani

from arxiv, 61 pages, 1 figure, submitted to Psychological Methods

Sequential Multiple-Assignment Randomized Trials (SMARTs) play an increasingly important role in psychological and behavioral health research. This experimental approach enables researchers to answer scientific questions about how best to sequence and match interventions to the unique and changing needs of individuals. A variety of sample size calculations have been developed in recent years, enabling researchers to plan SMARTs for addressing different types of scientific questions. However, relatively limited attention has been given to planning SMARTs with binary (dichotomous) outcomes, which often require higher sample sizes relative to continuous outcomes. Existing resources for estimating sample size requirements for SMARTs with binary outcomes do not consider the potential ability to improve power by including a baseline measurement and/or multiple longitudinal measurements. The current paper addresses this issue by providing sample size formulas for longitudinal binary outcomes and exploring their performance via simulations.

Atari · 學成 · Performer · 獎勵函數 · MoDELS ·

2018 年 11 月 15 日

Reward learning from human preferences and demonstrations in Atari

Borja Ibarz,Jan Leike,Tobias Pohlen,Geoffrey Irving,Shane Legg,Dario Amodei

from arxiv, NIPS 2018

To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions. Instead, we can have humans communicate an objective to the agent directly. In this work, we combine two approaches to learning from human feedback: expert demonstrations and trajectory preferences. We train a deep neural network to model the reward function and use its predicted reward to train an DQN-based deep reinforcement learning agent on 9 Atari games. Our approach beats the imitation learning baseline in 7 games and achieves strictly superhuman performance on 2 games without using game rewards. Additionally, we investigate the goodness of fit of the reward model, present some reward hacking problems, and study the effects of noise in the human labels.

離散化 · 馬爾可夫鏈蒙特卡羅 · 潛在 · 可交換的 · 話題模型 ·

2018 年 1 月 15 日

Latent nested nonparametric priors

Federico Camerlenghi,David B. Dunson,Antonio Lijoi,Igor Prünster,Abel Rodríguez

Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalising to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop an Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by product. The results and their inferential implications are showcased on synthetic and real data.