V being collinear is obvious, the question is/was also which additional orthogonal projections such as camera position for vision would improve the transformer.