Fine-tune for a Domain Text Generation

Hi I want to fine-tune BioGPT in HuggingFace for Q&A in specific domain. I am using BioGptForCausalLM.from_pretrained("microsoft/biogpt")

When I train the model using Trainer.Train(), I get the error "The model did not return a loss from the inputs, only the following keys: logits,past_key_values. For reference, the inputs it received are input_ids,attention_mask."

Which checkpoint call should I use?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fine-tune for a Domain Text Generation #111

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fine-tune for a Domain Text Generation #111

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions