Skip to content

Fashion-Reader/Visual-Question-Answering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

28 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Visual Question Answering

  • 이미지λ₯Ό 보고 μ£Όμ–΄μ§„ μ§ˆλ¬Έμ— λ‹΅λ³€ν•˜λŠ” Visual Question Answering λͺ¨λΈ 개발

Members

| κΆŒνƒœμ–‘ | λ₯˜μž¬ν¬ | λ°•μ’…ν—Œ | μ‹ μ°¬μ—½ | 쑰원 |

Set up

1. Install Requirements

$ pip install -r requirements.txt

2. Train

$ python3 train_v1.py # version 1
$ python3 train_v2.py # version 2
$ python3 train_v3.py # version 3

3. Inference

$ python3 inference_v1.py # version 1
$ python3 inference_v2.py # version 2
$ python3 inference_v3.py # version 3
$ python3 ensemble.py # ensemble

Code File

$ fashion_reader
β”œβ”€β”€ config
β”‚   β”œβ”€β”€ train_config_base.yaml
β”œβ”€β”€ models
β”‚   β”œβ”€β”€ get_model.py
β”‚   └── vqa_model.py
β”œβ”€β”€ modulus
β”‚   β”œβ”€β”€ dataset.py
β”‚   β”œβ”€β”€ earlystoppers.py
β”‚   β”œβ”€β”€ recorders.py
β”‚   β”œβ”€β”€ trainer.py
β”‚   └── utils.py
β”œβ”€β”€ results
β”œβ”€β”€ train_v1.py
β”œβ”€β”€ inference_v1.py
β”œβ”€β”€ train_v2.py
β”œβ”€β”€ inference_v2.py
β”œβ”€β”€ train_v3.py
β”œβ”€β”€ inference_v3.py
└── ensemble.py

Output

$ fashion_reader
└── results
    β”œβ”€β”€ train_v1
    β”‚     β”œβ”€β”€ loss.png
    β”‚     β”œβ”€β”€ model.pt
    β”‚     β”œβ”€β”€ answers.csv
    β”‚     β”œβ”€β”€ score.jpg
    β”‚     β”œβ”€β”€ train_config_base.yaml
    β”‚     └── train_log.log
    β”œβ”€β”€ train_v2
    β”‚     └── ...
    └── train_v3
          └── ...

Description

Version Pre-trained Model Config
V1 xlm-roberta-base & resnet50 Link
V2 xlm-roberta-large & resnet50 Link
V3 xlm-roberta-base & resnet152 Link

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages