顾美玲国产一区二区三区,无码人妻一区二区三区在线不卡,国产在线操大学生,人妻无码中文专区久久综合,看全色黄大色黄大片视频20

We propose a new way to assess certain short constructed responses to mathematics items. Our approach uses a pipeline that identifies the key values specified by the student in their response. This allows us to determine the correctness of the response, as well as identify any misconceptions. The information from the value identification pipeline can then be used to provide feedback to the teacher and student. The value identification pipeline consists of two fine-tuned language models. The first model determines if a value is implicit in the student response. The second model identifies where in the response the key value is specified. We consider both a generic model that can be used for any prompt and value, as well as models that are specific to each prompt and value. The value identification pipeline is a more accurate and informative way to assess short constructed responses than traditional rubric-based scoring. It can be used to provide more targeted feedback to students, which can help them improve their understanding of mathematics.

相關內容

可辨認的

關注 0

CASES · Analysis · Notability · 估計/估計量 · 向量化 ·

2023 年 10 月 10 日

An analysis on stochastic Lanczos quadrature with asymmetric quadrature nodes

Wenhao Li,Zongyuan Han,Yixuan Huang,Andy Wathen,Shengxin Zhu

from arxiv, 20 pages, 3 figures

The stochastic Lanczos quadrature method has garnered significant attention recently. Upon examination of the error analyses given by Ubaru, Chen and Saad and Cortinovis and Kressner, certain notable inconsistencies arise. It turns out that the former's results are valid for cases with symmetric quadrature nodes and may not be adequate for many practical cases such as estimating log determinant of matrices. This paper analyzes probabilistic error bound of the stochastic Lanczos quadrature method for cases with asymmetric quadrature nodes. Besides, an optimized error allocation technique is employed to minimize the overall number of matrix vector multiplications required by the stochastic Lanczos quadrature method.

Learning · 動力系統 · Automator · Continuity · 可理解性 ·

2023 年 10 月 10 日

A Bayesian framework for discovering interpretable Lagrangian of dynamical systems from data

Tapas Tripura,Souvik Chakraborty

Learning and predicting the dynamics of physical systems requires a profound understanding of the underlying physical laws. Recent works on learning physical laws involve generalizing the equation discovery frameworks to the discovery of Hamiltonian and Lagrangian of physical systems. While the existing methods parameterize the Lagrangian using neural networks, we propose an alternate framework for learning interpretable Lagrangian descriptions of physical systems from limited data using the sparse Bayesian approach. Unlike existing neural network-based approaches, the proposed approach (a) yields an interpretable description of Lagrangian, (b) exploits Bayesian learning to quantify the epistemic uncertainty due to limited data, (c) automates the distillation of Hamiltonian from the learned Lagrangian using Legendre transformation, and (d) provides ordinary (ODE) and partial differential equation (PDE) based descriptions of the observed systems. Six different examples involving both discrete and continuous system illustrates the efficacy of the proposed approach.

奇異的 · 泛函 · 近似 · 簇 · 模型評估 ·

2023 年 10 月 9 日

Resolution of singularities by rational functions

Astrid Herremans,Daan Huybrechs,Lloyd N. Trefethen

Results on the rational approximation of functions containing singularities are presented. We build further on the ''lightning method'', recently proposed by Trefethen and collaborators, based on exponentially clustering poles close to the singularities. Our results are obtained by augmenting the lightning approximation set with either a low-degree polynomial basis or poles clustering towards infinity, in order to obtain a robust approximation of the smooth behaviour of the function. This leads to a significant increase in the achievable accuracy as well as the convergence rate of the numerical scheme. For the approximation of $x^\alpha$ on $[0,1]$, the optimal convergence rate as shown by Stahl in 1993 is now achieved simply by least-squares fitting.

Analysis · 分段 · 論文 · 數值分析 ·

2023 年 10 月 9 日

Unifying a posteriori error analysis of five piecewise quadratic discretisations for the biharmonic equation

Carsten Carstensen,Benedikt Gr??le,Neela Nataraj

from arxiv, to be published in: Journal of Numerical Mathematics

An abstract property (H) is the key to a complete a priori error analysis in the (discrete) energy norm for several nonstandard finite element methods in the recent work [Lowest-order equivalent nonstandard finite element methods for biharmonic plates, Carstensen and Nataraj, M2AN, 2022]. This paper investigates the impact of (H) to the a posteriori error analysis and establishes known and novel explicit residual-based a posteriori error estimates. The abstract framework applies to Morley, two versions of discontinuous Galerkin, $C^0$ interior penalty, as well as weakly over-penalized symmetric interior penalty schemes for the biharmonic equation with a general source term in $H^{-2}(\Omega)$.

MoDELS · Learning · Performer · 語言模型化 · NLP ·

2023 年 10 月 9 日

Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance

Molly R. Petersen,Lonneke van der Plas

While analogies are a common way to evaluate word embeddings in NLP, it is also of interest to investigate whether or not analogical reasoning is a task in itself that can be learned. In this paper, we test several ways to learn basic analogical reasoning, specifically focusing on analogies that are more typical of what is used to evaluate analogical reasoning in humans than those in commonly used NLP benchmarks. Our experiments find that models are able to learn analogical reasoning, even with a small amount of data. We additionally compare our models to a dataset with a human baseline, and find that after training, models approach human performance.

MoDELS · Learning · Performer · Networking · Processing（編程語言） ·

2023 年 10 月 9 日

Meta predictive learning model of languages in neural circuits

Chan Li,Junbin Qiu,Haiping Huang

from arxiv, 31 pages, 6 figures, codes are available in the main text with the link

Large language models based on self-attention mechanisms have achieved astonishing performances not only in natural language itself, but also in a variety of tasks of different nature. However, regarding processing language, our human brain may not operate using the same principle. Then, a debate is established on the connection between brain computation and artificial self-supervision adopted in large language models. One of most influential hypothesis in brain computation is the predictive coding framework, which proposes to minimize the prediction error by local learning. However, the role of predictive coding and the associated credit assignment in language processing remains unknown. Here, we propose a mean-field learning model within the predictive coding framework, assuming that the synaptic weight of each connection follows a spike and slab distribution, and only the distribution, rather than specific weights, is trained. This meta predictive learning is successfully validated on classifying handwritten digits where pixels are input to the network in sequence, and moreover on the toy and real language corpus. Our model reveals that most of the connections become deterministic after learning, while the output connections have a higher level of variability. The performance of the resulting network ensemble changes continuously with data load, further improving with more training data, in analogy with the emergent behavior of large language models. Therefore, our model provides a starting point to investigate the connection among brain computation, next-token prediction and general intelligence.

ONCE · 約束 · GROUP · 示例 · Extensibility ·

2023 年 10 月 9 日

Coordination of multiple mobile manipulators for ordered sorting of cluttered objects

Jeeho Ahn,Seabin Lee,Changjoo Nam

from arxiv, Presented at iROS 2023

We present a coordination method for multiple mobile manipulators to sort objects in clutter. We consider the object rearrangement problem in which the objects must be sorted into different groups in a particular order. In clutter, the order constraints could not be easily satisfied since some objects occlude other objects so the occluded ones are not directly accessible to the robots. Those objects occluding others need to be moved more than once to make the occluded objects accessible. Such rearrangement problems fall into the class of nonmonotone rearrangement problems which are computationally intractable. While the nonmonotone problems with order constraints are harder, involving with multiple robots requires another computation for task allocation. The proposed method first finds a sequence of objects to be sorted using a search such that the order constraint in each group is satisfied. The search can solve nonmonotone instances that require temporal relocation of some objects to access the next object to be sorted. Once a complete sorting sequence is found, the objects in the sequence are assigned to multiple mobile manipulators using a greedy allocation method. We develop four versions of the method with different search strategies. In the experiments, we show that our method can find a sorting sequence quickly (e.g., 4.6 sec with 20 objects sorted into five groups) even though the solved instances include hard nonmonotone ones. The extensive tests and the experiments in simulation show the ability of the method to solve the real-world sorting problem using multiple mobile manipulators.

潛變量/隱變量 · MoDELS · 得分 · 潛在 · Learning ·

2023 年 10 月 7 日

Learning and scoring Gaussian latent variable causal models with unknown additive interventions

Armeen Taeb,Juan L. Gamella,Christina Heinze-Deml,Peter Bühlmann

With observational data alone, causal structure learning is a challenging problem. The task becomes easier when having access to data collected from perturbations of the underlying system, even when the nature of these is unknown. Existing methods either do not allow for the presence of latent variables or assume that these remain unperturbed. However, these assumptions are hard to justify if the nature of the perturbations is unknown. We provide results that enable scoring causal structures in the setting with additive, but unknown interventions. Specifically, we propose a maximum-likelihood estimator in a structural equation model that exploits system-wide invariances to output an equivalence class of causal structures from perturbation data. Furthermore, under certain structural assumptions on the population model, we provide a simple graphical characterization of all the DAGs in the interventional equivalence class. We illustrate the utility of our framework on synthetic data as well as real data involving California reservoirs and protein expressions. The software implementation is available as the Python package \emph{utlvce}.

離散化 · 有向 · 論文 · 相同 · 數學 ·

2023 年 10 月 6 日

Minors solve the elliptic curve discrete logarithm problem

Ansari Abdullah,Ayan Mahalanobis

The elliptic curve discrete logarithm problem is of fundamental importance in public-key cryptography. It is in use for a long time. Moreover, it is an interesting challenge in computational mathematics. Its solution is supposed to provide interesting research directions. In this paper, we explore ways to solve the elliptic curve discrete logarithm problem. Our results are mostly computational. However, it seems, the methods that we develop and directions that we pursue can provide a potent attack on this problem. This work follows our earlier work, where we tried to solve this problem by finding a zero minor in a matrix over the same finite field on which the elliptic curve is defined. This paper is self-contained.

過擬合 · SimPLe · Principle · 模型評估 · 統計量 ·

2021 年 3 月 16 日

Deep learning: a statistical viewpoint

Peter L. Bartlett,Andrea Montanari,Alexander Rakhlin

The remarkable practical success of deep learning has revealed some major surprises from a theoretical perspective. In particular, simple gradient methods easily find near-optimal solutions to non-convex optimization problems, and despite giving a near-perfect fit to training data without any explicit effort to control model complexity, these methods exhibit excellent predictive accuracy. We conjecture that specific principles underlie these phenomena: that overparametrization allows gradient methods to find interpolating solutions, that these methods implicitly impose regularization, and that overparametrization leads to benign overfitting. We survey recent theoretical progress that provides examples illustrating these principles in simpler settings. We first review classical uniform convergence results and why they fall short of explaining aspects of the behavior of deep learning methods. We give examples of implicit regularization in simple settings, where gradient methods lead to minimal norm functions that perfectly fit the training data. Then we review prediction methods that exhibit benign overfitting, focusing on regression problems with quadratic loss. For these methods, we can decompose the prediction rule into a simple component that is useful for prediction and a spiky component that is useful for overfitting but, in a favorable setting, does not harm prediction accuracy. We focus specifically on the linear regime for neural networks, where the network can be approximated by a linear model. In this regime, we demonstrate the success of gradient flow, and we consider benign overfitting with two-layer networks, giving an exact asymptotic analysis that precisely demonstrates the impact of overparametrization. We conclude by highlighting the key challenges that arise in extending these insights to realistic deep learning settings.