Continual Pre-training of Language Models ICLR 2023
novel proxy for initialization
soft-masking
constrastive learning