About some statements in KNOWLEDGE FUSION OF LARGE LANGUAGE MODELS 

Dear authors, I have some problems while reading the paper KNOWLEDGE FUSION OF LARGE LANGUAGE MODELS, I noticed there are two formats discribed the losses designed for training.  I suppose the loss function should be minimized while the discrepancy function D(·) is minimized, but these formats show an opposite result?
  
![image](https://github.com/user-attachments/assets/a0788346-f676-4a1e-b025-73e57b711afe)
![image](https://github.com/user-attachments/assets/e2de1582-f886-4cd1-8587-bee4167b6263)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About some statements in KNOWLEDGE FUSION OF LARGE LANGUAGE MODELS #24

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

About some statements in KNOWLEDGE FUSION OF LARGE LANGUAGE MODELS #24

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions