Hi, I tried running Boxer on my own RGB-D dataset captured from a fixed desktop camera.
The dataset is organized in the static_rgbd format:
root/
cam_K.txt
rgb/
depth/
I used the prompt apple. The 2D detection looks correct and detects the apple, but the predicted 3D box does not stay aligned with the apple. It appears to remain near the initial position or drift away from the object.
My command was:
bash
python run_boxer.py \
--input /path/to/apple \
--labels=apple \
--track \
--track_mode dynamic \
--thresh2d 0.15 \
--thresh3d 0.3
I would like to ask:
Is Boxer expected to work well on fixed-camera desktop RGB-D scenes with small objects such as apples?
Is the model mainly intended for room-scale indoor scenes / Aria-style data rather than tabletop object tracking?
Are there any recommended settings or data requirements for this type of use case?
Hi, I tried running Boxer on my own RGB-D dataset captured from a fixed desktop camera.
The dataset is organized in the
static_rgbdformat: