KTO training with datasets in alpaca format

Nice work!

I'm glad to find that LLaMA-Factory supports KTO training. But training with datasets in alpaca format will lead to an error that all datapoints will be described as desired examples. A possible reason might be that `examples["response"][i][0]["content"]` [here](https://github.com/hiyouga/LLaMA-Factory/blob/main/src/llamafactory/data/preprocess.py#L293) will always be true.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KTO training with datasets in alpaca format #3803

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development