Large Language Models

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Reinforcement learning (RL) has become an effective approach for fine-tuning large language models (LLMs), particularly to enhance their reasoning capabilities. However, RL …

yifan-sun

• Jun 10, 2025 • 1 min read

Bayesain Deep Learning

TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning

While Large Language Models (LLMs) have demonstrated impressive capabilities, their output quality remains inconsistent across various application scenarios, making it difficult to …

tunyu-zhang

• May 16, 2025 • 1 min read

Bayesain Deep Learning

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

Advances in Neural Information Processing Systems (NeurIPS), 2025

haizhou-shi

• Dec 10, 2024 • 1 min read

Bayesain Deep Learning

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models

Advances in Neural Information Processing Systems (NeurIPS), 2024

Yibin Wang

• Jun 17, 2024 • 1 min read

Large Language Models

Continual learning of large language models: A comprehensive survey

The challenge of effectively and efficiently adapting statically pre-trained Large Language Models (LLMs) to ever-evolving data distributions remains predominant. When tailored for …

haizhou-shi

• Apr 25, 2024 • 1 min read

No results found

Large Language Models

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models

Continual learning of large language models: A comprehensive survey