Contents
대규모 언어모델을 위한 연속 학습의 효율적 적용에 관한 연구
1. 서론
대규모 언어모델(LLM)의 발전과 함께, 이런 모델들의 지속적인 학습 및 업데이트의 필요성이 증가하고 있다. 특히, 기존 정보의 정확성을 유지하면서 새로운 데이터를 효과적으로 통합하는 연속 학습의 중요성이 강조되고 있다. 본 연구에서는 LLM에 특화된 새로운 연속 학습 아키텍처를 제안하며, 이를 통해 모델의 학습 효율성과 정확성을 동시에 개선할 수 있는 방법을 탐구한다.
2. 연구 배경 및 관련 이론
2.1 대규모 언어모델의 연속 학습
연속 학습은 기존의 지식을 유지하면서 지속적으로 새로운 정보를 학습하는 프로세스를 말한다. 이는 모델이 새로운 데이터에 노출되었을 때 기존의 정보를 잊어버리는 ‘급격한 망각(catastrophic forgetting)’ 문제를 해결하는 데 중요하다. 연속 학습의 핵심은 다음과 같은 수학적 접근을 통해 이루어진다.
\[\theta_{t+1} = \theta_t - \eta \nabla L(D_{new}, \theta_t)\]\(\theta_t\)는 시간 \(t\)에서의 모델 파라미터, \(\eta\)는 학습률, \(L\)은 손실 함수, \(D_{new}\)는 새로운 데이터셋을 의미한다. 이 과정은 이전 데이터에 대한 지식을 유지하면서 새로운 데이터를 효과적으로 통합하기 위해 고안된다.
2.2 벤치마크 및 데이터셋
연구에서 사용된 주요 벤치마크는 SuperGLUE, TydiQA 등이다. 이 벤치마크들은 언어 이해, 응답 생성 등 다양한 NLP 태스크를 포함하며, 모델의 다양한 능력을 평가하는 데 활용된다. 데이터셋은 연속적으로 변화하는 세계적인 사건과 정보를 반영하기 위해 지속적으로 업데이트되며, 모델이 최신 지식을 습득하고 적용할 수 있도록 지원한다.
3. 방법
3.1 모델 아키텍처
제안된 연속 학습 아키텍처는 기존 LLM의 구조를 확장하여, 새로운 정보를 더욱 효율적으로 통합할 수 있도록 설계되었다. 이 아키텍처는 다음과 같은 수학적 구조를 기반으로 한다.
\[\text{Loss} = \sum_{i=1}^n L(y_i, f(x_i; \theta))\]\(L\)은 손실 함수, \(y_i\)는 실제 레이블, \(x_i\)는 입력 데이터, \(f\)는 모델 함수, \(\theta\)는 모델 파라미터를 나타낸다. 이 구조는 새로운 정보를 받아들이면서도 이전에 학습한 정보를 잊지 않도록 돕는다.
3.2 학습 알고리즘
연속 학습을 위한 학습 알고리즘은 다음과 같은 과정을 포함한다.
이 알고리즘은 수학적 최적화 이론과 통계적 학습 이론을 기반으로 하며, 모델의 지속적인 성능 개선을 도모한다.
4. 실험 및 결과
4.1 실험 설정
모델은 여러 NLP 태스크를 포함하는 벤치마크 데이터셋을 사용하여 평가되었다. 실험은 다양한 설정 하에서 수행되었으며, 모델의 학습률, 망각 방지 기술의 효
과 등이 분석되었다.
4.2 결과 분석
실험 결과, 제안된 모델과 학습 방법은 기존 방법에 비해 유의미한 성능 개선을 보였다. 특히, 새로운 데이터에 대한 빠른 적응력과 함께 이전 데이터에 대한 지식 보존 능력이 향상되었다는 점이 입증되었다.
5. 결론 및 향후 연구 방향
이 연구는 대규모 언어모델의 연속 학습을 위한 새로운 아키텍처를 제안하고, 이를 다양한 데이터셋과 벤치마크를 통해 검증하였다. 연구 결과는 모델의 학습 효율성 및 정확성을 동시에 개선할 수 있는 가능성을 보여준다. 향후 연구에서는 더욱 다양한 데이터와 복잡한 태스크에서의 모델 성능을 검증하고, 연속 학습 기술의 최적화를 추구할 예정이다.
Recent years have witnessed the rapid advances of large language models’ (LLMs) capabilities in solving a diverse range of problems. At the same time, it is vital for LLMs to be regularly updated to accurately reflect the ever-evolving human knowledge, values and linguistic patterns, calling for the investigation of continual learning for LLMs. Whilst continual learning bears some resemblance to other strategies for model improvements, such as retrieval-augmented generation (RAG) [Lewis et al., 2020] and model editing [Yao et al., 2023], their main purposes differ (Table 1). Unlike these strategies, whose primarily focus is on refining the domainspecific accuracy or expanding the model’s factual knowledge base, continual learning aims to enhance the overall linguistic and reasoning capabilities of LLMs. This distinction is crucial as it shifts the focus from merely updating information to developing a model’s ability to process and generate language in a more comprehensive and nuanced manner [Zhang et al., 2023d].
Continual learning for LLMs also differs from its use in smaller models, including smaller pre-trained language models (PLMs). Due to their vast size and complexity, LLMs require a multi-faceted approach to continual learning. We categorise it into three different stages, i.e. continual pretraining to expand the model’s fundamental understanding of language [Jin et al., 2022], continual instruction tuning to improve the model’s response to specific user commands [Zhang et al., 2023e], and continual alignment to ensure the model’s outputs adhere to values, ethical standards and societal norms [Zhang et al., 2023a]. This multi-stage process is distinct from the more linear adaptation strategies used in smaller models, as illustrated in Figure 1, highlighting the unique challenges and requirements of applying continual learning to LLMs.
Figure 1: Continual learning for large language models involves hybrid multi-stage training with multiple training objectives.
This survey differentiates itself from previous studies by its unique focus and structure. While previous surveys in the field are typically organized around various continual learning strategies [Biesialska et al., 2020], ours is the first to specifically address continual learning in the context of LLMs. We structure our analysis around the types of information that is updated continually and the distinct stages of learning involved in LLMs. This survey offers a detailed and novel perspective on how continual learning is applied to LLMs, shedding light on the specific challenges and opportunities of this application. Our goal is to provide a thorough understanding of the effective implementation of continual learning in LLMs, contributing to the development of more advanced and adaptable language models in the future.
Figure 2: The continual learning of LLMs involves multi-stage and cross-stage iteration, which may lead to substantial forgetting problems. For example, when the instruction-tuned model resumes continual pre-training, it may encounter cross-stage forgetting, resulting in reduced performance on instruction-following tasks.
Table 1: Continual Learning v.s. RAG and Model Editing
Large language models (LLMs) like ChatGPT1 and LLaMA [Touvron et al., 2023] have shown superior performance in many tasks. They are usually trained in multiple stages, including pre-training, instruction tuning, and alignment, as ilIn the pre-training stage, LLMs are lustrated in Figure 1. trained on a large corpus in a self-supervised manner [Dong et al., 2019], where the training text is randomly masked and the LLMs are asked to predict the masked tokens. In the instruction tuning stage, LLMs are fine-tuned on a set of instructionoutput pairs in a supervised fashion [Zhang et al., 2023b]. Given a task-specific instruction as input, LLMs are asked to In the alignment stage, generate the corresponding output. LLMs are further finetuned with human feedback to align their outputs with human expectations [Wang et al., 2023c]. The output of LLMs is scored by human annotators, and the LLMs are updated to generate more human-like responses.
Continual learning focuses on developing learning algorithms to accumulate knowledge on non-stationary data, often delineated by classes, tasks, domains or instances. In supervised continual learning, a sequence of tasks {D1, . . . , DT } arrive int in a streaming fashion. Each task Dt = {xt i=1} contains a separate target dataset, where xt i ∈ Yt. A single model needs to adapt to them sequentially, with only access to Dt at the t-th task. This setting requires models to acquire, update, accumulate, and exploit knowledge throughout their lifetime [Biesialska et al., 2020].
The major challenge conventional continuous learning tackles is that of catastrophic forgetting, where the performance of a model on old tasks significantly diminishes when trained with new data. Existing studies can be roughly grouped into three categories, e.g., experience replay methods [Chaudhry et al., 2019; Wu et al., 2021], regularizationbased methods [Kirkpatrick et al., 2017; Chen et al., 2023b], and dynamic architecture methods [Mallya et al., 2018]. Recently, researchers have designed some hybrid methods that take advantage of the aforementioned techniques [Chen et al., 2023a; He et al., 2024].
Continual Learning for Large Language Models aims to enable LLMs to learn from a continuous data stream over time. Despite the importance, it is non-trivial to directly apply existing continual learning settings for LLMs. We now provide a forward-looking framework of continual learning for LLMs, then present a categorization of research in this area.
Framework Our learning for LLMs is illustrated in Figure 2. W align continual learning for LLMs with the different training stages, including Continual Pre-training (CPT), Continual Instruction Tuning (CIT), and Continual Alignment (CA). The Continual Pre-training stage aims to conduct training on a sequence of corpus selfsupervisedly to enrich LLMs’ knowledge and adapt to new domains. The Continual Instruction Tuning stage finetunes LLMs on a stream of supervised instruction-following data, aiming to empower LLMs to follow users’ instructions while transferring acquired knowledge for subsequent tasks. Responding to the evolving nature of human values and preferences, Continual Alignment (CA) tries to continuously align LLMs with human values over time.
While continual learning on LLMs can be conducted in each stage sequentially, the iterative application of continual learning also makes it essential to transfer across stages without forgetting the ability and knowledge learned from previous stages. For instance, we can conduct continual pretraining based on either instruction-tuned models or aligned models. However, we do not want the LLM to lose their ability to follow users’ instructions and align with human values. Therefore, as shown in Figure 2, we use arrows with different colors to show the transfer between stages.
Categorization To better understand the research in this area, we provide a fine-grained categorization for each stage of the framework.
Continual Pre-training (CPT)
Continual Instruction Tuning (CIT)
Continual Alignment (CA)
Besides categorizing methods based on training stages, we also provide an alternative categorization based on the information updated during continual learning. In Table 2, we list some representative information that is updated for LLMs, e.g., facts, domains, tasks, values, and preferences. Based on the training objectives of LLMs, this information can be updated in different stages of LLM continual learning. The taxonomy in Figure 3 shows our categorization scheme and recent representative work in each category.
Continual pretraining in large language models is essential for keeping the LLMs relevant and effective. This process involves regularly updating the models with the latest information [Jang et al., 2022a], adapting them to specialized domains [Ke et al., 2023], enhancing their coding capabilities [Yadav et al., 2023], and expanding their linguistic range [Castellucci et al., 2021]. With CPT, LLMs can stay current with new developments, adapt to evolving user needs, and remain effective across diverse applications. Continual pretraining ensures LLMs are not just knowledgeable but also adaptable and responsive to the changing world.
Table 2: Information updated during different stages of continual learning for LLMs.
The capability of LLMs to integrate and adapt to recent information is crucial. A pivotal strategy here is the employment of dynamic datasets that facilitate the real-time assimilation of data from a variety of sources like news feeds [Sun et al., 2020], scholarly articles [Cossu et al., 2022], and social media [Cossu et al., 2022]. [Sun et al., 2020] presents ERNIE 2.0, which is a continual pre-training framework that incrementally builds and learns from multiple tasks to maximize knowledge extraction from training data. [Jang et al., 2022b] introduces continual knowledge learning, a method for updating temporal knowledge in LLMs, reducing forgetting while acquiring new information. [Jang et al., 2022a] shows that continual learning with different data achieves comparable or better perplexity in language models than training on the entire snapshot, confirming that factual knowledge in LMs can be updated efficiently with minimal training data. Integral to this process is the implementation of automated systems for the verification of newly acquired data, ensuring both the accuracy and dependability of the information.
Continual pre-training updates domain knowledge through two approaches: 1) domain-incremental pre-training accumulates knowledge across multiple domains, and 2) domainspecific continual learning, which evolves a general model into a domain expert by training on domain-specific datasets and tasks. In domain-incremental pre-training, [Cossu et al., 2022] explores how models can be continually pre-trained on new data streams for both language and vision, preparing them for various downstream tasks. [Qin et al., 2023b] examines continual retraining by assessing model compatibility and benefits of recyclable tuning via parameter initial[Ke et al., 2023] introization and knowledge distillation. duces a soft-masking mechanism to update language models (LMs) with domain corpora, aiming to boost performance while preserving general knowledge. For domain-specific continual learning, [Xie et al., 2023] develops FinPythia-6.9B through domain-adaptive pre-training for the financial sector. EcomGPT-CT [Ma et al., 2023] investigates the effects of continual pre-training in the E-commerce domain. These studies collectively highlight the evolving landscape of continual pre-training, demonstrating its effectiveness in enhancing model adaptability and expertise across a wide range of domains.
Expanding the range of languages that LLMs can understand and process is essential for ensuring broader accessibility [Castellucci et al., 2021]. This expansion is not just about including a wider variety of languages, particularly underrepresented ones, but also about embedding cultural contexts into language processing. A significant challenge here is the model’s ability to recognize and interpret regional dialects and contemporary slangs [Gogoulou et al., 2023], which is crucial for effective and relevant communication across diverse racial, social and cultural groups.
In addition to mastering natural languages, LLMs have also made significant strides in understanding and generating programming languages. [Yadav et al., 2023] introduced CodeTask-CL, a benchmark for continual code learning that encompasses a diverse array of tasks, featuring various input and output formats across different programming languages. [Zan et al., 2022] explore using an unlabeled code corpus for training models on library-oriented code generation, addressing the challenge of scarce text-code pairs due to extensive library reuse by programmers. They introduce CERT, a method where a ”sketcher” outlines a code structure, and a ”generator” completes it, both continuously pre-trained on unlabeled data to capture common patterns in library-focused code snippets. These developments highlight LLMs’ potential to transform both natural and programming language processing, leading to more efficient coding practices.
LLMs have shown great instruction following abilities that can be used to complete different tasks with a few-shot task prompt. Continual Instruction Tuning (CIT) involves continually fine-tuning the LLMs to learn how to follow instructions and transfer knowledge for future tasks [Zhang et al., 2023e]. Based on the ability and knowledge updated during instruction tuning, we can further divide CIT into three categories: 1) task-incremental CIT, 2) domain-incremental CIT, and tool-incremental CIT.
CIT Task-incremental Continual Instruction Tuning (Taskincremental CIT) aims to continuously finetune LLMs on a sequence of task-specific instructions and acquire the ability to solve novel tasks. A straightforward solution is to continuously generate instruction-tuning data for new tasks and directly fine-tune LLMs on it [Wang et al., 2023b]. However, studies have shown that continuously fine-tuning LLMs on task-specific data would cause a catastrophic forgetting of the learned knowledge and problem-solving skills in previous tasks [Kotha et al., 2023]. TAPT [Gururangan et al., 2020] presents a simple data selection strategy that retrieves unlabeled text from the in-domain corpus, aligning it with the task distribution. This retrieved text is then utilized to finetune LLMs, preventing catastrophic forgetting and enhancing argument performance. To mitigate catastrophic forgetting, Contunual-T0 [Scialom et al., 2022] employs rehearsal with a memory buffer [Shin et al., 2017] to store previous tasks data and replay them during training. ConTinTin [Yin et al., 2022] presents InstructionSpeak, which includes two strategies that make full use of task instructions to improve forward-transfer and backward-transfer. The first strategy involves learning from negative outputs, while the second strategy focuses on revisiting instructions from previous tasks. RationaleCL [Xiong et al., 2023] conducts contrastive rationale replay to alleviate catastrophic forgetting. DynaInst [Mok et al., 2023] proposes a hybrid approach incorporating a Dynamic Instruction Replay and a local minima-inducing regularizer. These two components enhance the generalizability of LLMs and decrease memory and computation usage in the replay module. Unlike previous replay-based or regularization-based methods, SLM [Anonymous, 2024b] incorporates vector space retrieval into the language model, which aids in achieving scalable knowledge expansion and management. This enables LLMs’ quick adaptation to novel tasks without compromising performance caused by catastrophic forgetting.
LLMs with billions of parameters introduce a huge computational burden for conducting continual learning. To address this issue, the Progressive Prompts technique [Razdaibiedina et al., 2023] freezes the majority of parameters and only learns a fixed number of tokens (prompts) for each new task. Progressive Prompts significantly reduce the computational cost while alleviating catastrophic forgetting and improving the transfer of knowledge to future tasks. ELM [Jang et al., 2023] first trains a small expert adapter on top of the LLM for each task. Then, it employs a retrieval-based approach to choose the most pertinent expert LLM for every new task. Based on the parameter-efficient tuning (PET) framework, OLoRA [Wang et al., 2023a] proposes an orthogonal low-rank adaptation for CIT. O-LoRA incrementally learns new tasks in an orthogonal subspace while fixing the LoRA parameters learned from past tasks to minimize catastrophic forgetting. Similarly, DAPT [Zhao et al., 2024] proposes a novel Dual Attention Framework to align the learning and selection of LoRA parameters via the Dual Attentive Learning&Selection module. LLaMA PRO [Wu et al., 2024] proposes a novel block expansion technique, which enables the injection of new knowledge into LLMs and preserves the initial capabilities with efficient post-training.
Continual Instruction Tuning (Domainincremental CIT) aims to continually finetune LLMs on a sequence of domain-specific instructions and acquire the knowledge to solve tasks in novel domains. TAPT [Gururangan et al., 2020] adaptively tunes the LLMs on a series of domain-specific data including biomedicine, computer science, news, and shopping reviews. Then, it evaluates the LLMs’ text classification ability in each domain. ConPET [Song et al., 2023] applies previous continual learning methods, initially developed for smaller models, to LLMs using PET and a dynamic replay strategy. This approach significantly reduces tuning costs and mitigates overfitting and forgetting problems. Experiments conducted on a typical continual learning scenario, where new knowledge types gradually emerge, demonstrate the superior performance of ConPET. AdaptLLM [Cheng et al., 2023a] adapts LLMs to different domains by enriching the raw training corpus into a series of reading comprehension tasks relevant to its content. These tasks are designed to help the model leverage domain-specific knowledge while enhancing prompting performance. PlugLM [Cheng et al., 2023b] uses a differentiable plug-in memory (DPM) to explicitly store the domain knowledge. PlugLM could be easily adapted to different domains by plugging in in-domain memory. [Zhang et al., 2023c] designs an adapt-retrieve-revise process that adapts LLMs to new domains. It first uses the initial LLMs’ respose to re-
trieve knowledge from the domain database. The retrieved knowledge is used to revise initial responses and obtain final answers. [Dong et al., 2023] analyze the LLMs continuously tuned on different domains and find that the sequence of training data has a significant impact on the performance of LLMs. They also offer a Mixed Fine-tuning (DMT) strategy to learn multiple abilities in different domains.
Tool-incremental Continual Instruction Tuning (Toolincremental CIT) aims to fine-tune LLMs continuously, enabling them to interact with the real world and enhance their abilities by integrating with tools, such as calculators, search engines, and databases [Qin et al., 2023a]. With the rapid emergence of new tools like advanced software libraries, novel APIs, or domain-specific utilities [Liang et al., 2023; Jin et al., 2023], there is a growing need to continually update LLMs so they can quickly adapt and master these new tools. Llemma [Azerbayev et al., 2023] continues tuning LLMs on a dataset with a mixture of math-related text and code to enable LLMs to solve mathematical problems by using external tools. ToolkenGPT [Hao et al., 2023] represents each tool as a new token (toolken) whose embedding is learned during instruction tuning. This approach offers an efficient way for LLMs to master tools and swiftly adapt to new tools by adding additional tokens.
LLMs need to adapt to evolving societal values, social norms and ethical guidelines. Furthermore, there exists substantial diversity in preferences across different demographic groups, as well as individuals’ changing preferences over time. The need to respond to these changes give rise to continual alignment. In the context of continual alignment, two scenarios emerge: (i) the requirement to update LLMs to reflect shifts in societal values and (ii) integrating new demographic groups or value types to existing LLMs, which we will describe in the following subsections.
Continual value alignment aims to continually incorporate ethical guidelines or adapt to cultural sensitivities and norms. It requires updating to unlearn outdated notions and incorporating new values, akin to model editing and unlearning tasks. Model editing and knowledge unlearning have been studied in pretraining and instruction tuning phases [Yao et al., 2023]; however, they have not yet been explored in preference learning.
Adding new demographic groups or value types aligns with continual learning problems, aiming to guide LLMs in generating responses aligned with emerging values while adhering to previously learned ones. For example, many opensource aligned LLMs employ reinforcement learning with human feedback (RLHF) for safety. We may want to align the LLMs for additional attributes such as helpfulness and faithfulness. Beyond the challenge of retaining past preferences while maximising the reward on new ones, continual preference learning also faces difficulties in stable and efficient training with a large action space (vocabulary) and a large number of parameters. Previous works have demonstrated proof-of-concept of such agents. However, there is a lack of standardized benchmarks to systematically evaluate the learning capabilities of new preferences over time. Continual Proximal Policy Optimization (CPPO) [Anonymous, 2024a] utilizes a sample-wise weighting on the Proximal Policy Optimization (PPO) algorithm [Schulman et al., 2017] to balance policy learning and knowledge retention in imitating the old policy output. On the other hand, [Zhang et al., 2023a] extend the Direct Preference Optimization (DPO) algorithm [Rafailov et al., 2023] to the continual learning setting by employing Monte Carlo estimation to derive a sequence of optimal policies for the given sequences of tasks and incorporate them to regularize the policy learning on new tasks.
The systematic evaluation of LLMs’ continual learning performance demands benchmarks with high-quality data sources and diverse content. Below we summarize notable benchmark dataets.
TemporalWiki [Jang et al., 2022a] serves as a lifelong benchmark, training and evaluating Language Models using consecutive snapshots of Wikipedia and Wikidata, helping assess an LM’s ability to retain past knowledge and acquire new knowledge over time. Additional social media datasets like Firehose [Hu et al., 2023] comprise 100 million tweets from one million users over six years. CKL [Jang et al., 2022b] focuses on web and news data, aiming to retain time-invariant world knowledge from initial pretraining while efficiently learning new knowledge through continued pre-training on different corpora. TRACE [Wang et al., 2023b] encompasses eight diverse datasets covering specialized domains, multilingual tasks, code generation, and mathematical reasoning. These datasets are harmonized into a standard format, facilitating straightforward and automated evaluation of LLMs. Due to the fast-paced nature of data, time-sensitive datasets quickly become outdated, necessitating frequent updates to continual pre-training benchmarks for model evaluation.
The Continual Instruction Tuning Benchmark (CITB) [Zhang et al., 2023e] is based on SuperNI, encompassing over 1,600 Natural Language Processing (NLP) tasks across 76 types like language generation and classification, all in a text-totext format. ConTinTin [Yin et al., 2022], another benchmark derived from NATURAL-INSTRUCTIONS, includes 61 tasks across six categories, such as question generation and classification. When using these benchmarks for evaluating black-box language learning models that cannot access their training data, the selection of datasets is crucial to avoid task contamination and ensure reliable performance assessment in continual instruction tuning.
COPF [Zhang et al., 2023a] conduct experiments for continual alignment using datasets like the Stanford Human Preferences (SHP) [Ethayarajh et al., 2022] and Helpful & Harmless (HH) Datasets [Bai et al., 2022]. The SHP Dataset comprises 385,000 human preferences across 18 subjects, from cooking to legal advice. The HH Dataset consists of two parts: one where crowdworkers interact with AI models for helpful responses, and another where they elicit harmful responses, selecting the more impactful response in each case. Despite the growing interest in this field, there is a notable absence of dedicated benchmarks for continual alignment, presenting an opportunity for future research and development in this area.
Sequence Continual learning for large language models involves evaluating the model’s performance over a task sequence. Performance can be measured by three typical continual learning metrics: (1) average performance; (2) Forward Transfer Rate (FWT), and (3) Backward Transfer Rate (BWT) [Lopez-Paz and Ranzato, 2017; Wu et al., 2022]:
(1) FWT assesses the impact of knowledge acquired from previous tasks on the initial ability to perform a new task, prior to any dedicated training for that new task.
(2) BWT measures catastrophic forgetting by comparing a model’s performance on old tasks before and after learning new ones.
(3) Average Performance, e.g., the average accuracy assesses the ability of a model or algorithm to effectively learn from and adapt to a sequence of data streams or tasks over time.
where At,i is the accuracy of models on the test set of ith task after model learning on the tth task and ˜bi is the test accuracy for task i at random initialization.
Large language models continually trained on different stages can experience the issue of unconscious forgetting [Lin et al., 2023], which shows that continual instruction tuning can erode the LLM’s general knowledge. Additionally, previous studies [Qi et al., 2023] also demonstrate that the behavior of safely aligned LLMs can be easily affected and degraded by instruction tuning. To quantify these limitations, TRACE [Wang et al., 2023b] proposes to evaluate LLMs by using three novel metrics: General Ability Delta (GAD), Instruction Following Delta (IFD), and Safety Delta (SD):
(1) GAD assesses the performance difference of an LLM on general tasks after training on sequential target tasks.
(2) IFD assesses the changes of model’s instruction=following ability after training on sequential different tasks.
(3) SD assesses the safety variation of a model’s response after sequential training.
The baseline performance of the initial LLM on the i-th task is represented by R0,i. After incrementally learning up to the t-th task, the score on the i-th task becomes Rt,i. And RG, RI , and RS represent the performance of LLM on general tasks (assessing the information obtained from pre-training), instruction-following tasks, and alignment tasks, respectively. These measure changes in an LLM’s overall capabilities, adherence to instructions, and safety after continual learning, going beyond traditional benchmarks by focusing on maintaining inherent skills and aligning with human preferences.
Computation-efficient Continual Learning In the realm of computation efficiency, the focus is on enhancing the continual pretraining process with minimized computational resources [Verwimp et al., 2023]. This involves developing innovative architectures that can handle the increasing complexity of pretraining tasks without proportional increases in computational demands. Efficiency in algorithms and data structures becomes crucial, especially in managing the extensive data involved in pretraining. Additionally, energyefficient learning models are vital for sustainable scaling of LLMs, aligning with Green AI initiatives. This area requires balancing the computational cost vs the benefits in terms of model performance and capabilities.
Social Good Continual Learning Social responsibility in continual learning encompasses ensuring privacy and data security, particularly in the context of continual instruction tuning [Gabriel, 2020]. As LLMs are fine-tuned with more specific instructions or tasks, the handling of sensitive or personal data must be managed securely and ethically. Aligning with human values and culture is also paramount, especially in the realm of continual preference learning. This involves incorporating ethical AI principles and cultural sensitivities to ensure that the model’s outputs are aligned with societal norms and values.
Automatic Continual Learning A significant challenge lies in creating systems capable of autonomously overseeing their learning processes, seamlessly adjusting to novel tasks (instruction tuning) and user preferences (alignment) while relying solely on the inherent capabilities of LLMs, all without the need for manual intervention [Qiao et al., 2024]. Automatic continual learning includes multi-agent systems capable of collaborative learning and self-planning algorithms that can autonomously adjust learning strategies based on performance feedback. Such systems would represent a significant advancement in the autonomy of LLMs. Continual Learning with Controllable Forgetting Controllable forgetting is particularly relevant to continual pretraining. The ability to selectively retain or forget information as the model is exposed to new data streams can prevent catastrophic forgetting [Qi et al., 2023] and enhance model adaptability [Wang et al., 2023b]. This challenge also extends to managing misinformation and unlearning incorrect or outdated information [Chen and Yang, 2023], ensuring the accuracy and reliability of the LLM over time. Continual Learning with History Tracking Effective history tracking is vital for understanding the evolution of the LLM through its phases of pre-training, instruction tuning, and preference learning. Managing history in model parameters and using external memory architectures can help in tracking the influence of past learning on current model behavior and decisions [Mialon et al., 2023]. This is crucial for analyzing the effectiveness of continual learning processes and making informed adjustments. Theoretical insights on LLM in Continual Learning. Numerous evaluation studies have examined the issue of cross-stage forgetting [Lin et al., 2023] and demonstrated the weak robustness of aligned LLMs [Qi et al., 2023]. However, theoretical analyses of how multi-stage training impacts the performance of large language models in subsequent continual learning tasks are scarce. This gap highlights the need for a deeper understanding of the specific changes multi-stage training introduces to LLMs’ learning capabilities and longterm performance.
Continual learning holds the vital importance of allowing large language models to be regularly and efficiently updated to remain up-to-date with the constantly changing human knowledge, language and values. We showcase the complex, multi-stage process of continual learning in LLMs, encompassing continual pretraining, instruction tuning, and alignment, a paradigm more intricate than those used in continual learning on smaller models. As the first survey of its kind to thoroughly explore continual learning in LLMs, this paper categorizes the updates by learning stages and information types, providing a detailed understanding of how to effectively implement continual learning in LLMs. With a discussion of major challenges and future work directions, our goal is to provide a comprehensive account of recent developments in continual learning for LLMs, shedding light on the development of more advanced and adaptable language models.