Gradient Tensor - Search News

XDA Developers on MSN

I served a 200 billion parameter LLM from a Lenovo workstation the size of a Mac Mini

This mini PC is small and ridiculously powerful.

Smith, G.F. (1971) On Isotropic Functions of Symmetric Tensors, Skew-Symmetric Tensors and Vectors. International Journal of Engineering Science, 9, 899-916.

ABSTRACT: This paper presents derivation of micro and macro conservation and balance laws and the constitutive theories for the linear elastic micromorphic theory, in which elasticity is considered ...

GitHub

Do you use the full gradient tensor during LLM data selection?

Hello! Thanks for your great work! It is truly the best work I've come across in the field of data selection. I have a question about how the gradients of LLMs are handled in the method: When ...

EurekAlert!

Beating the AI bottleneck

Artificial intelligence (AI) is infamous for its resource-heavy training, but a new study may have found a solution in a novel communications system, called ZEN, that markedly improves the way large ...

IEEE

Hyperspectral Image Denoising and Destriping via Gradient Tensor Subspace Low-Rank Learning and Along-Across Stripe Directional Constraints

Abstract: This article proposes a new hyperspectral image (HSI) denoising and destriping method via gradient tensor subspace low-rank learning and along-across stripe directional constraints ...

PNAS

Sprain energy consequences for damage localization and fracture mechanics

Commonplace in practice, computational fracture predictions are often erroneous (e.g., for concrete, fiber-composites, geotechnics, earthquake, biomaterials). As recently revealed by the gap test, the ...

GitHub

output_hook is not storing tensors

Hi. thanks for your awesome source code. I'm training the model with your example code with l1 & l2 regularization. I tried to check the gradient flow of the tensor during the backward process.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results