Fine-tune DeBERTa v3 language model, worthwhile endeavour?

Hey everyone, I've been using RoBERTa for the past year or so but have been looking into DeBERTa as well.  My typical workflow with RoBERTa is to fine-tune the MLM using ~3mil medical reports to domain adapt before training on down-stream tasks.   I've found that this greatly improved performance of the downstream models.

With DeBERTa, I presume that I can't use my existing code for fine-tuning the MLM since DeBERTa doesn't use MLM, it uses RTD.  The pre-training scripts here seem to be for training a model from scratch (which I don't think I have good enough data or compute power/time to do efficiently).  

I presume that if I wanted to fine-tune the RTD language model, I would use the "deberta-v3-X-continue" option in rtd.sh?  If so, do you guys think this would be worth my time? Or should I just fine tune my downstream tasks on the supplied pre-trained models?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fine-tune DeBERTa v3 language model, worthwhile endeavour? #151

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fine-tune DeBERTa v3 language model, worthwhile endeavour? #151

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions