Linear Gradient CSS Codings

138 NYC Schools That Are Defying Expectations When it Comes to Reading

Aldeman: New York City families have a wide variety of district and charter schools where literacy rates are soaring despite ...

GitHub

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective (ACL'25 Oral)

This is the repo for the Layer_Gradient project, in which we try to understand the layer-wise gradient behaviors when LLMs are finetuned on Fast vs. Slow Thinking. What makes a difference in the ...

GitHub

GReaTer - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers

In this paper, we propose a new technique of prompt optimization with smaller language models, called GReaTer that leverages gradient information over task-specific reasoning. Compared to text-based ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

138 NYC Schools That Are Defying Expectations When it Comes to Reading

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective (ACL'25 Oral)

GReaTer - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers

Trending now