How to evaluate a pretrained model?

@huminghao16 
Could you include the scripts for evaluating a pretrained model? 
(for example evaluating the large model you have included in the readme.)

I am running this command: 
```
export DATA_DIR=data/drop
export BERT_TRAINED_DIR=out/drop_mtmsn_model
python -m bert.run_mtmsn \
  --vocab_file $BERT_TRAINED_DIR/vocab.txt \
  --bert_config_file $BERT_TRAINED_DIR/bert_config.json \
  --init_checkpoint $BERT_TRAINED_DIR/checkpoint.pth.tar \
  --do_predict \
  --do_lower_case \
  --predict_file $DATA_DIR/drop_dataset_dev.json \
  --predict_batch_size 48 \
  --max_seq_length 512 \
  --span_extraction \
  --addition_subtraction \
  --counting \
  --negation \
  --gradient_accumulation_steps 2 \
  --optimize_on_cpu \
  --output_dir out/drop_mtmsn_model_dev_file
```
where:
```
$ ls out/drop_mtmsn_model
bert_config.json  checkpoint.pth.tar  network.txt  parameter.txt  performance.txt  predictions.json  vocab.txt
```
The output predictions that I see are all random (e.g., many negative numbers). 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to evaluate a pretrained model? #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to evaluate a pretrained model? #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions