banner
IWSR

IWSR

我永远喜欢志喜屋梦子!

Homogeneous coordinates and matrix derivation of basic geometric transformations in two-dimensional graphics.

Homogeneous coordinates

"Homogeneous coordinates are one of the important tools in computer graphics, which can be used to distinguish between vectors and points, and are also easier to use for affine (linear) geometric transformations." - F.S. Hill, JR.

As mentioned in the quote, the biggest advantage of homogeneous coordinates is that they can differentiate between the description of "coordinates" and "vectors".

Simply put, in a regular Cartesian coordinate system (or Cartesian coordinate system), (xA, yA) can represent point A, or it can be used to represent vector oA\vec{oA}. This ambiguous representation is not conducive to accurate abstract description for computers.

Homogeneous coordinates solve this problem by elevating n dimensions to n+1 dimensions.

We can add an additional variable w to the end of a 2D Cartesian coordinate to form a 2D homogeneous coordinate. Therefore, a point (X, Y) becomes (x, y, w) in homogeneous coordinates, and we have

X = x/w

Y = y/w

In homogeneous coordinates,

  • Describing a point A is represented as (xA, yA, 1)
  • Describing a vector oA\vec{oA} is represented as (xA, yA, 0)

By substituting w=1,0 into x/w, we can understand why 1 represents a point (position) and 0 represents a vector (direction).

In addition, it is also convenient for operations such as vector addition.

image

Of course, in addition to describing vectors and points, the introduction of homogeneous coordinates also facilitates the description of geometric transformations (linear transformations).

For example, if we don't use homogeneous coordinates to represent 2D translation, it would look like this:

image

Basic Geometric Transformations in 2D Graphics#

2D graphic transformations can be roughly divided into the following five categories: translation, scaling, rotation, reflection, and shearing.

1. Translation#

Describes the transformation from point (x, y) to (x + dx, y + dy).

By introducing homogeneous coordinates, it can be expressed as (x, y, 1) transformed to (x + dx, y + dy, 1).

At this point, linear transformation can be used as a tool to describe the transformation process. After introducing the transformation matrix, the problem becomes solving the transformation matrix.

Given:

(abcdefghi)(xy1)=(x+dxy+dy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} x + dx \\ y + dy \\ 1 \end{pmatrix}

We have:

{ax+by+c=x+dxdx+ey+f=y+dygx+hy+i=1\begin{cases} ax + by + c = x + dx \\ dx + ey + f = y + dy \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(10dx01dy001)\begin{pmatrix} 1 & 0 & dx \\ 0 & 1 & dy \\ 0 & 0 & 1 \end{pmatrix}

Thus, we can use this transformation matrix to describe the translation process in mathematical terms.

2. Scaling#

Describes the transformation from point (x, y) to (sx * x, sy * y), where sx and sy are constants.

By introducing homogeneous coordinates, it can be expressed as (x, y, 1) transformed to (sx * x, sy * y, 1).

By introducing the transformation matrix:

Given:

(abcdefghi)(xy1)=(sxxsyy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} sx * x \\ sy * y \\ 1 \end{pmatrix}

We have:

{ax+by+c=sxxdx+ey+f=syygx+hy+i=1\begin{cases} ax + by + c = sx * x \\ dx + ey + f = sy * y \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(sx000sy0001)\begin{pmatrix} sx & 0 & 0 \\ 0 & sy & 0 \\ 0 & 0 & 1 \end{pmatrix}

3. Rotation#

Explanation of rotation requires the introduction of the unit circle.

image

In the figure, point B is rotated to point C, and the angle between AB and the X-axis is α, and the angle between AC and AB is β.

Then the coordinates of point B can be represented as (cosα, sinα), and the coordinates of point C are (cos(α + β), sin(α + β)).

Expanding the coordinates of point C, we have C = (cosα cosβ - sinα sinβ, sinα cosβ + cosα sinβ).

Let the coordinates of point B be (x, y), then the coordinates of point C are (x cosβ - y sinβ, y cosβ + x sinβ).

By introducing homogeneous coordinates, it can be expressed as (x, y, 1) transformed to (x cosβ - y sinβ, y cosβ + x sinβ, 1).

By introducing the transformation matrix:

Given:

(abcdefghi)(xy1)=(xcosβysinβycosβ+xsinβ1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} xcosβ - ysinβ \\ ycosβ + xsinβ \\ 1 \end{pmatrix}

We have:

{ax+by+c=xcosβysinβdx+ey+f=ycosβ+xsinβgx+hy+i=1\begin{cases} ax + by + c = xcosβ - ysinβ \\ dx + ey + f = ycosβ + xsinβ \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(cosβsinβ0sinβcosβ0001)\begin{pmatrix} cosβ & -sinβ & 0 \\ sinβ & cosβ & 0 \\ 0 & 0 & 1 \end{pmatrix}

4. Reflection#

"In mathematics, reflection is a mapping that transforms an object into its mirror image. To reflect a plane figure, a line (reflection axis) is needed, and for reflection in three-dimensional space, a plane is used as the mirror."

If we follow the quote, reflection can be divided into reflection based on the X-axis and reflection based on the Y-axis. However, there is also the concept of central reflection (point reflection).

Reflection based on the X-axis#

Describes the transformation from point (x, y) to (x, -y).

Given:

(abcdefghi)(xy1)=(xy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} x \\ -y \\ 1 \end{pmatrix}

We have:

{ax+by+c=xdx+ey+f=ygx+hy+i=1\begin{cases} ax + by + c = x \\ dx + ey + f = -y \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(100010001)\begin{pmatrix} 1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \end{pmatrix}

Reflection based on the Y-axis#

Describes the transformation from point (x, y) to (-x, y).

Given:

(abcdefghi)(xy1)=(xy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} -x \\ y \\ 1 \end{pmatrix}

We have:

{ax+by+c=xdx+ey+f=ygx+hy+i=1\begin{cases} ax + by + c = -x \\ dx + ey + f = y \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(100010001)\begin{pmatrix} -1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}

Reflection based on the point (p, q)#

Describes the transformation from point (x, y) to (2p-x, 2q-y).

Given:

(abcdefghi)(xy1)=(2px2qy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} 2p-x \\ 2q-y \\ 1 \end{pmatrix}

We have:

{ax+by+c=2pxdx+ey+f=2qygx+hy+i=1\begin{cases} ax + by + c = 2p-x \\ dx + ey + f = 2q-y \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(102p012q001)\begin{pmatrix} -1 & 0 & 2p \\ 0 & -1 & 2q \\ 0 & 0 & 1 \end{pmatrix}

5. Shearing#

Definition is shown in the figure, it is like the distortion of a shape in a certain direction. The range of α and β is [0, 90°).

image

Shearing with y-axis as the dependent axis#

Describes the transformation from point (x, y) to (x + y * tanα, y).

Given:

(abcdefghi)(xy1)=(x+y.tanαy1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} x+y.tanα \\ y \\ 1 \end{pmatrix}

We have:

{ax+by+c=x+y.tanαdx+ey+f=ygx+hy+i=1\begin{cases} ax + by + c = x+y.tanα \\ dx + ey + f = y \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(1tanα0010001)\begin{pmatrix} 1 & tanα & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}

Shearing with x-axis as the dependent axis#

Describes the transformation from point (x, y) to (x, y + x * tanβ).

Given:

(abcdefghi)(xy1)=(xy+x.tanβ1)\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} \begin{pmatrix} x \\ y \\ 1 \end{pmatrix} = \begin{pmatrix} x \\ y + x.tanβ \\ 1 \end{pmatrix}

We have:

{ax+by+c=xdx+ey+f=y+x.tanβgx+hy+i=1\begin{cases} ax + by + c = x \\ dx + ey + f = y + x.tanβ \\ gx + hy + i = 1 \end{cases}

The solution for the transformation matrix is:

(100tanβ10001)\begin{pmatrix} 1 & 0 & 0 \\ tanβ & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}
Loading...
Ownership of this post data is guaranteed by blockchain and smart contracts to the creator alone.