The prediction made by the head is not a single task, but multiple tasks simultaneously with a shared layer.