HO-3Dv3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset
Abstract
HO-3D is a dataset providing image sequences of various hand-object interaction scenarios annotated with the 3D pose of the hand and the object and was originally introduced as HO-3Dv2. The annotations were obtained automatically using an optimization method, 'HOnnotate', introduced in the original paper. HO-3Dv3 provides more accurate annotations for both the hand and object poses thus resulting in better estimates of contact regions between the hand and the object. In this report, we elaborate on the improvements to the HOnnotate method and provide evaluations to compare the accuracy of HO-3Dv2 and HO-3Dv3. HO-3Dv3 results in 4mm higher accuracy compared to HO-3Dv2 for hand poses while exhibiting higher contact regions with the object surface.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.