Skip to content

Commit aff0de1

Browse files
committed
fix run_qwen3_vl_8b_dflash_online.sh
1 parent c19b963 commit aff0de1

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

examples/run_qwen3_vl_8b_dflash_online.sh

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -21,13 +21,13 @@ torchrun \
2121
--train-data-path $ROOT_DIR/cache/dataset/allava4v-mix-20k_train.localimg_regen.jsonl \
2222
--build-dataset-num-proc $BUILD_DATASET_NUM_PROC \
2323
--min-pixels 50176 \
24-
--max-pixels 1003520 \
24+
--max-pixels 802816 \
2525
--output-dir $ROOT_DIR/outputs/qwen3-vl-8b-allava4v20k-dflash \
2626
--cache-dir $ROOT_DIR/cache \
2727
--num-epochs 6 \
2828
--batch-size 2 \
29-
--learning-rate 6e-4 \
30-
--warmup-ratio 0.04 \
29+
--learning-rate 1e-4 \
30+
--warmup-ratio 0.08 \
3131
--max-grad-norm 1.0 \
3232
--max-length 4096 \
3333
--num-draft-layers 5 \

0 commit comments

Comments
 (0)