Intrinsic Camera Parameters
I am trying to understand intrinsic camera parameters.
Specifically I not able to understand the skew factor and pixel scaling derivation in it.
Example:
Most of the literature explanation starts like this (x, y, z) - point in real world (x', y') - point in image Derivation Step 1: x' = xf/z y' = yf/z
Step 2: Account for origin shift of the coordinate system (Image plane) (Ox, Oy)
- x' = xf/z + Ox
- y' = yf/z + Oy
Step 3:
Account in skew, here I really don't understand it Consider a point in the image and its coordinates
According to oblique coordinate system - u', v' According to Cartesian coordinate system - u, v
Angle between basis of oblique coordinates - l
relationship
- u' = u - v*cot(l)
- v' = v/sin(l)
What I don't understand is they replace x and y with u' and v'. Instead I feel it should be other way around, x and y with u and v.
After that step, pixel scaling is done.
In that too most textbook misses one condition when they consider height and width of pixel they don't mention the cases. For example in case of skew is the height along the basis of oblique coordinates or along the Cartesian coordinates. This is also confusing?
Two summarize I have two question:
Q-1 How coordinate system is transformed when there is skew?
Q-2 How pixel scaling is done?