Global Positioning from a Single Image of a Rectangle in Conical Perspective

Estrems Amestoy, Manuel; de Francisco Ortiz, Óscar

doi:10.3390/s19245432

Open AccessArticle

Global Positioning from a Single Image of a Rectangle in Conical Perspective

by

Manuel Estrems Amestoy

¹

and

Óscar de Francisco Ortiz

^2,*

¹

Mechanics, Materials and Manufacturing Engineering department, Technical University of Cartagena, 30202 Cartagena, Spain

²

Department of Engineering and Applied Technologies, University Center of Defense, San Javier Air Force Base, MDE-UPCT, 30720 Santiago de la Ribera, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2019, 19(24), 5432; https://doi.org/10.3390/s19245432

Submission received: 22 November 2019 / Revised: 5 December 2019 / Accepted: 6 December 2019 / Published: 10 December 2019

(This article belongs to the Section State-of-the-Art Sensors Technologies)

Download

Browse Figures

Versions Notes

Abstract

:

This article presents a method to obtain the overall positioning of the focus of a camera from an image that includes a rectangle in a fixed reference with known position and dimension. This technique uses basic principles of descriptive geometry introduced in engineering courses. The document will first show how to obtain the dihedral projections of a rectangle after three turns and one translation. Secondly, we will proceed to obtain the image of the rectangle rotated in a conical perspective, taking the elevation plane as the drawing plane and a specific point in space as the view point, and represented in the dihedral system. Thirdly, we proceed with the inverse perspective transformation; we will expose a method to obtain the coordinates in the space of a rectangle obtained from an image. Finally, we check the method experimentally by taking an image of the rectangle with a camera in which the coordinates in the drawing plane (center of the image) are the only available position information. Then, the positioning and orientation of the camera in 3D will be obtained.

Keywords:

conical perspective; dihedral projection; positioning; trigonometry

1. Introduction

Pose determination is to estimate the position and orientation of one calibrated camera using a set of correspondences between 3D control points and 2D image points [1]. Determination of surface orientation has important applications such robotics, object recognition, 3D measurement or tracking of moving objects. Magee [2] was the first to present a procedure for determining the unique position of a robot in a three dimensional space. That method has been continuously improved in different areas as large non-cooperative satellites [3] or Unmanned Aerial Vehicle (UAV) Control [4,5]. Different methods for monocular pose estimation have been studied in the past [6,7,8,9]. More recently, marker-based positioning systems as ArUco, Chilitags, ApriTags, or ArToolKit, among others, have been introduced to estimate quantitative changes in distances and orientations in many technological applications, such as autonomous robots [10,11,12], unmanned vehicles [13,14,15,16], or virtual assistants [17,18,19,20].

The calibration and orientation of a camera from its images has been obtained through different approaches in the past with good precision through techniques such as using a single image with four coplanar control lines [21], three coplanar circles [22], using parallelogrammatic grid points [23], or even using only three points in the world coordinate system when a multiple camera system is used [24]. Becker [25] introduced a new technique using an iterative method which solves the parameters that minimize vanishing point dispersion to solve for radial and decentering lens distortion directly from the results of vanishing point estimation, precluding the need for special calibration templates. Single image based reconstruction has been deeply studied by many authors such as Delage [26], Wilczkowiak et al. [27], Sturm and Maybank [28] or Micusik et al. [29], assuming perpendicularity and parallelism to recover the lack of information. Other authors such as Penna [30] showed that there is sufficient information in the two-dimensional perspective projection of an arbitrary quadrilateral of known shape and size in three-space to determine the exact three-dimensional coordinates of its vertices, generalizing known results for rectangles. Duan [1] used the projection of a trapezium for pose estimation and plane measurement in a very simple way. An iterative algorithm was used by Hong & Yang [31] to establish the relationship between parameters and the world coordinates of a given 3D calibration point. Nevertheless, additional studies via rectangular structures as in Haralick [8] or Wefelscheid [32] use similar concepts with a different approach. In contrast, our research used the information provided by the dihedral projections of a rectangle to determine the image of the rectangle rotated in a conical perspective.

Computer vision has been used in areas, such as unmanned vehicles, to estimate relative 3D position and altitude using algorithms based on four feature points, such as square and parallel relations, to avoid complicated calculations [33]. An algorithm for pose estimation based on volume measurement of tetrahedra composed of target points and the lens center of the vision system was proposed by Abidi [6]. 3D model reconstruction from a single image calibrating a camera and recovering the geometry and the photometry of objects was part of Guillou’s [34] research and a novel method to find the initial solutions for iterative camera pose estimation using coplanar points was provided by Zhou [35]. A general photogrammetric method for determining object position and orientation was presented by Yuan [36]. Recently Wang et al. [37] studied active relocalization of a 3D camera pose from a single reference image; a recent and challenging problem in computer vision and robotics. Pose estimation of smooth metal parts is an important task in intelligent manufacturing. Ulrich [38], Sakcak [39], Han [40] and He [41] proposed a solution using a monocular camera and corresponding practical algorithms.

The adjustment of tools in machining centers is usually the slowest and most critical operation in the positioning of the machined parts. The provision of a tool that includes machine displacements and images with edge detection can be adjusted at micrometric scales without the need for lasers or probes. Other possible application could be the metrology by vision, since in the case of characteristics to be measured in the same plane of a rectangle of known dimensions, the dihedral perspective of the aforementioned characteristic can be obtained and non-contact metrological checks can be performed immediately (in real time) compensating many of the existing errors. This is an essential aspect to achieve the efficiency and flexibility required by controls in production systems in Industry 4.0.

In this work a new method to obtain the camera coordinates of a rectangle from its image is proposed. This method is based in the principles of descriptive geometry as developed by Monge [42], which is studied in basic engineering courses. In order to explain the method a remembrance of the construction of a rectangle in conical perspective is described, and an inverse path is proposed. Finally, an experiment has been designed to check the precision of the method.

2. Dihedral Projection of a Rectangle. Rotations and Translations

In this case the problem input data is the dihedral projection of a rectangle in which the length of one side L is known. Therefore, it is represented by its coordinates

x^{*}

and

z^{*}

. This rectangle is rotated by three angles

ϕ

,

ξ

, and

θ

. The transformation matrices are applied to obtain a global rotation matrix and the translation is made to the point

X_{0}

, the coordinates of the vertices are then obtained and presented in a table of dihedral information. The Top View of the dihedral would be represented by the

x y

plane, and the elevation of the dihedral is the

x z

plane. The projections of the rectangle on both planes will be its dihedral representation [42,43].

3. Conical Projection

With the point of view with coordinates

(V_{x}, V_{y}, 0)

and represented in the same dihedral system as the rectangle, where the Front View coincide with the image plane and

V_{y}

is the focal distance, the vertices coordinates

(x^{*}, z^{*})

in the rectangle in the conical perspective are obtained. The method used consists in creating, from the Top View, a line that passes through

(V_{x}, V_{y}, 0)

and the Top Projection of the point P

(P_{x}, P_{y}, 0)

obtaining the intersection with the image plane which will be the coordinate

x_{p}^{*}

. This coordinate

x_{p}^{*}

is calculated by drawing the line that passes through

(V_{x}, V_{y}, 0)

and the Front View of the point P

(P_{x}, P_{y}, 0)

and obtaining the intersection with the vertical line that starts at

x^{*}

. Consequently, the conical projection of the point in the image plane with coordinates

(x^{*}, z^{*})

is calculated. When this operation is performed Figure 1 with the four points in the rectangle, the rectangle in conical perspective is obtained.

4. Obtaining the Possible Front View and Top View of Dihedral Projection of the Rectangle

Using the coordinates in the conical perspective and knowing the projection of the point of view in the Front and Top planes, the coordinates of the edges of the rectangle are calculated (Figure 2).

To do this, we use part of the geometric method described by Wefelscheid et al. [32] obtaining auxiliary points that help us calculate the dihedral projection from the conical projection. These auxiliary points are:

Point $M^{*}$ . Intersection between the lines joining $P_{1}^{*} P_{3}^{*}$ and $P_{2}^{*} P_{4}^{*}$ .
Vanishing point $V_{1}^{*}$ as intersection between the lines $P_{1}^{*} P_{2}^{*}$ and $P_{3}^{*} P_{4}^{*}$ .
Vanishing point $V_{2}^{*}$ as intersection between the lines $P_{1}^{*} P_{4}^{*}$ and $P_{2}^{*} P_{3}^{*}$ .
Midpoint of edges $P_{12}^{*}$ , $P_{23}^{*}$ , $P_{34}^{*}$ , $P_{14}^{*}$ as intersection of the lines that are drawn from the vanishing point to $M^{*}$ with the respective edges.

The auxiliary points are represented in Figure 3.

These operations can be done graphically by drawing on paper, so it is computationally reduced to intersections between lines that are defined each by two points as in Table 1 as represented in Figure 3.

After obtaining these points, a proposal of the Front View of the rectangle is based on two graphic properties:

The points of the Front View projection are in the lines that start from the center point V whose coordinates are ( $V_{x}, 0, V_{z}$ ) and go though the point of the image $P_{1}^{*}$ , $P_{2}^{*}$ , $P_{3}^{*}$ , $P_{4}^{*}$ , $M^{*}$ , $P_{12}^{*}$ , $P_{23}^{*}$ , $P_{34}^{*}$ , $P_{14}^{*}$ .
In the dihedral projection the center points are in the geometric center of the segment of the side, dividing this side in two. For example, $P_{12}$ is in the center point of the segment that joins $P_{1}$ and $P_{2}$ .
Opposite sides are parallel in the dihedral projection.

By taking advantage of these two properties and a trigonometric interrelation, a first proposal of a rectangle in Front View can be obtained by the following procedure:

In the triangle $P_{1}^{*} P_{2}^{*} V^{*}$ which is divided by the segment $V^{*} P_{12}^{*}$ , a line that starts at $P_{12}$ and its intersection with the lines $V^{*} P_{1}^{*}$ and $V^{*} P_{2}^{*}$ is equidistant, in a way that a possible point $P_{12}$ in Front View can be obtained as shown in Figure 4.
An arbitrary distance d to obtain $P_{12}$ is taken.
The normal vector of the line $P_{1} P_{2}$ in dihedral will be found by a rotation of the vector $V^{*} P_{12}$ an angle $ω$ reached using the trigonometric relation (1):

$tan ω = \frac{2}{- cot α + cot β .}$

(1)

The deduction of this expression is detailed in Appendix A, where $α$ is the angle between $\vec{V^{*} P_{2}^{*}}$ and $\vec{V^{*} P_{12}^{*}}$ ; and $β$ is the angle between $\vec{V^{*} P_{1}^{*}}$ and $\vec{V^{*} P_{12}^{*}}$ represented in the Figure 4 and expressed by the Equations (2) and (3).

$α = arccos \frac{\vec{V^{*} P_{2}^{*}} \cdot \vec{V^{*} P_{12}^{*}}}{| \vec{V^{*} P_{2}^{*}} | | \vec{V^{*} P_{12}^{*}} |}$

(2)

$β = arccos \frac{\vec{V^{*} P_{1}^{*}} \cdot \vec{V^{*} P_{12}^{*}}}{| \vec{V^{*} P_{1}^{*}} | | \vec{V^{*} P_{12}^{*}} | .}$

(3)
Points $P_{1}$ and $P_{2}$ are obtained from the intersection of the line defined by the point $P_{12}$ and the vector $\vec{V^{*} P_{12}}$ rotated an angle $ω$ . Once the orientations are calculated, starting from a point in the line $V^{*} P_{1}^{*}$ and drawing a line that intersects the line $V^{*} P_{2}^{*}$ gets the hypothetical side $P_{1} P_{2}$ already in the Front View of the dihedral projection.
With the presumed points $P_{1}$ and $P_{2}$ of the Front View in the dihedral projection, it is possible to calculate, with the central point V in the Top view (which is at a distance equal to the focal distance from the drawing plane), the projection in Top view of points $P_{1}$ and $P_{2}$ as shown in Figure 5.
To get the Top View of the rectangle from the hypothetical Front projection of the side $P_{1} P_{2}$ it is possible to obtain its Top projection from the Top projection of V which is at a focal distance from the drawing plane. As an example, the y coordinate of the point $P_{1}$ will be obtained by the intersection of the line that joins V in the Top View with the point $x_{P 1}^{*}$ and the coordinate $x_{P 1}$ as shown in Figure 6 and in Equation (4). In the same way we proceed to obtain the y coordinate of the point $P_{2}$ .

$y_{p 1} = V_{y} + \frac{V_{y}}{V_{x} - x_{P 1}^{*}}$

(4)
With the Top and Front projection of the points $P_{1}$ y $P_{2}$ , the length in pixel units of the segment is calculated, and the real distance d that means the length $P_{1} P_{2}$ matched with the length of the edge of the rectangle is obtained.
With the complete coordinates of the points $P_{1}$ y $P_{2}$ the distance between both points is calculated.
Being a proportional geometric problem, the solution is found in a single step from the application of Thales’ theorem.
A similar procedure is done with the triangle $P_{2}^{*} P_{3}^{*} V^{*}$ divided by $V^{*} P_{23}$ . Consequently, the two orientations of the edges which will have the projection of the rectangle in the dihedral system are calculated.
Getting the rest of the points is direct as we have the orientations in Top View, finding the points $P_{3}$ y $P_{4}$ using a correlative method.

Once the three-dimensional coordinates of the rectangle are found, it is possible to perform any operation related to positioning and orientation of the camera or distance calculation and angle modification. The full method is represented in Figure 7.

5. Comments on the Described Method and Comparison with Previous Ones

The new method has many advantages over the methods of Haralik [8] and Wefelscheid [32] which are the most used:

It has the advantage of working with points and lines as it works in descriptive geometry science, making the calculations much more intuitive, based on simple sequences.
The method makes little use of trigonometric functions. The only trigonometric relation used is the tangent angle function between two vectors which induces very few floating point errors. In addition to this, a rotation is applied on the vectors to redraw the rectangle edges in dihedral.
It is a direct method without iterations or matrix inversions.
As it is sequential, we can perform checks and easily determine where an error may have occurred. Once the calculations have been verified, the equations in mega formulas that save the calculation times can be exposed. The algebraic operations to obtain the points barely exceed one hundred which equates to less than thousandths of a second of computer time.

With the results, several verification can be performed since it provides data which can already be calculated such as:

The length of the second edge of the rectangle, since it has not been used for the calculation of the inverse perspective.
The spatial lines that join the point V with the vanishing points $V_{1}$ and $V_{2}$ in the drawing plane, are parallel to the sides of the rectangle so they are perpendicular to each other. Consequently, the scalar product must be zero, which means that the starting data (the focal length) can actually be determined from the vanishing points [34].

6. Positioning of the Camera in Coordinate System Defined in the Rectangle

Object tracking systems in space through images, navigation systems or calculation of distances and angles in images can be easily made from the coordinates of known rectangles that serve as a reference (Figure 8). Therefore, they can be used for tracker systems in positioning of parts in specific coordinate systems.

In global coordinates (5):

\vec{i} = \frac{\vec{P_{1} P_{2}}}{| \vec{P_{1} P_{2}} |} \vec{j} = \frac{\vec{P_{4} P_{1}}}{| \vec{P_{4} P_{1}} |} \vec{k} = \frac{\vec{P_{1} P_{2}} \land \vec{P_{4} P_{1}}}{| \vec{P_{1} P_{2}} \land \vec{P_{4} P_{1}} |} = \vec{i} \land \vec{j} .

(5)

The position in global coordinates would be given by (6):

\vec{X_{v}} = \{\begin{matrix} x_{v} - x_{M} \\ y_{v} - y_{M} \\ z_{v} - z_{M} \end{matrix}\},

(6)

and the check in local coordinates would be obtained according to (7):

\vec{X_{v}} = \{\begin{matrix} \vec{M V} \cdot \vec{i} \\ \vec{M V} \cdot \vec{j} \\ \vec{M V} \cdot \vec{k} \end{matrix}\} .

(7)

In global coordinates the orientation of the camera follows the

{\vec{j}}_{v}

vector (8):

\{\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}\} {\vec{j}}_{v} .

(8)

In rectangle coordinates (9) is obtained, which is the projection of j on each of the three axes that coincides with the component in y of the three vectors

\vec{i}

,

\vec{j}

, and

\vec{k}

expressed in global coordinates.

\{\begin{matrix} j_{v x} \\ j_{v y} \\ j_{v z} \end{matrix}\} = \{\begin{matrix} \vec{i} \cdot {\vec{j}}_{v} \\ \vec{j} \cdot {\vec{j}}_{v} \\ \vec{k} \cdot {\vec{j}}_{v} \end{matrix}\} .

(9)

7. Experimental Tests

For the test of the method all the images are taken with a CASIO Exilim EX-ZR200 digital camera with a resolution of 4608 × 3456 (16MPixels) and a sensor dimension of 6.16 × 4.62 mm (1/2.3”). After the calibration the focal distance is 4.6 mm and the central point is not in the middle of the image but at coordinates (2186, 1991). The camera is calibrated by standard method of Computer Vision by images taken of a chessboard. To analyze the position a Coordinate Measuring Machine (CMM) model Pioneer DEA 03.10.06 with measuring strokes 600 × 1000 × 600 mm has been used as seen in Figure 9. The Maximum Permissible Error of the DEA in the measurements is

2.8 + 4.0

L/1000

μ

m. The software for the measurements was PC-DMIS.

The procedure followed was as follows:

Place a DIN A4 size paper on the granite table.
Position the camera on a tripod.
Take photo of DIN A4 paper remotely so as not to influence the captured image.
Take six points of the camera housing according to the 3-2-1 method [44].
Calculate the position of the camera focus with respect to the center of the A4 sheet from the palpated points.
Contrast this position with that obtained by image analysis.
Repeat several times the steps 3–6, while varying the position and angles of the camera.

The parameters used for the test are summarized in Table 2. The coordinates and distances of the paper center obtained by CMM and the image analysis including the differences of the coordinates and distances between both, where subindex e refers to experimental data calculated with the CMM and subindex t refers to theoretical values calculated by the image analysis algorithm, are shown in the Table 3.

Result comments:

The differences in measured distances are less than 2%, e.g., less than 2 cm in 1 m distance.
The errors in the x coordinate are due to the parallelism of the lines with the image plane, but they hardly affect the distance calculation since the contribution of the x coordinate is small in the global calculation.

As can be seen in Table 3 and Figure 10, the difference between the distance from the camera to the center of the folio, measured by the image analysis and by the CMM, is less than 2%. This indicates that, visually, very close accuracies of the actual distances can be achieved. Nevertheless, analyzing each coordinate, a very high error in the x coordinates is observed in some points. These errors occur when the camera is facing the folio, being the x-axis parallel to the sensor plane. In consequence, the vanishing points in this direction are far apart, and the intersection between the lines is more imprecise. As in these cases, the camera is located at a small value of the x coordinate, the influence of this value on the global error is reduced. This indicates a limitation since the method works best, the closer the vanishing points remain.

Figure 11 shows the images obtained with the camera and used to verify the method presented in this article.

8. Conclusions

A new method has been proposed for rectangle reconstruction using elements of descriptive geometry, as used by Monge in 1847 [42], and of extensive knowledge by engineering users since it is taught in the early stages of such studies. The method presented is mainly based on the intersection between lines, as their calculations are fast and stable in computing and, therefore, minimize errors and optimize computation. The proposed process uses very few trigonometric functions of small angles that are the main source of errors in other methods, so very few floating-point errors are introduced. Additionally, the trigonometric functions are mainly used for the rotation of vectors to align the edges in dihedral projections, which also reduces the errors.

In addition, a procedure was carried out to experimentally test the calculations. The proposed technique was tested in a CMM by locating the camera through the palpation using the 3-2-1 method and the position given by the CMM was compared with the calculation from the image taken by the camera. The proposed method provides maximum errors of 2% in the measured distances. The big errors detected in individual coordinates are due to the parallelism of two sides with the image plane since, in this case, the vanishing point is distance in space and its determination by the intersection of two almost parallel lines has more variability.

Author Contributions

Conceptualization, M.E.A. and Ó.d.F.O.; methodology, M.E.A. and Ó.d.F.O.; software, Ó.d.F.O. and M.E.A.; validation, Ó.d.F.O. and M.E.A.; formal analysis, M.E.A.; writing–original draft preparation, Ó.d.F.O.; writing–review and editing, M.E.A. and Ó.d.F.O.; supervision, M.E.A.; visualization, Ó.d.F.O. and M.E.A.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

UAV	Unmanned Aerial Vehicle
CMM	Coordinate Measurement Machine
DIN	Deutsches Institut für Normung (German Institute for Standardization)

Appendix A. Obtaining the Expression of tanω

In order to identify the sign of

α

and

β

the vector product is calculated and the corresponding sign is found (A1) and (A2):

sin α = \frac{\vec{V^{*} P_{2}^{*}} \land \vec{V^{*} P_{12}^{*}}}{| \vec{V^{*} P_{2}^{*}} | | \vec{V^{*} P_{12}^{*}} |}

(A1)

sin β = \frac{\vec{V^{*} P_{1}^{*}} \land \vec{V^{*} P_{12}^{*}}}{| \vec{V^{*} P_{1}^{*}} | | \vec{V^{*} P_{12}^{*}} |} .

(A2)

The angle

ω

can be obtained from

α

and

β

applying the sine law (A3):

\frac{L}{2 sin α} = \frac{| \vec{V^{*} P_{12}^{*}} |}{sin (π - α - ω)} = \frac{| \vec{V^{*} P_{12}^{*}} |}{sin α cos ω + cos α sin ω}

(A3)

\frac{L}{2 sin β} = \frac{| \vec{V^{*} P_{12}^{*}} |}{sin (ω - β)} = \frac{| \vec{V^{*} P_{12}^{*}} |}{sin ω cos β + cos ω sin β}

(A4)

\frac{L cos ω}{2 sin α} = \frac{| \vec{V^{*} P_{12}^{*}} |}{sin α + cos α tan ω}

(A5)

\frac{L cos ω}{2 sin β} = \frac{| \vec{V^{*} P_{12}^{*}} |}{cos β tan ω - sin β}

(A6)

\frac{L cos ω}{2} = \frac{| \vec{V^{*} P_{12}^{*}} |}{1 + cot α tan ω} = \frac{| \vec{V^{*} P_{12}^{*}} |}{cot β tan ω - 1}

(A7)

1 + cot α tan ω = cot β tan ω - 1

(A8)

2 + cot α tan ω = cot β tan ω .

(A9)

Consequently, the angle of rotation

ω

can be calculated using the Equation (A10):

tan ω = \frac{2}{- cot α + cot β} .

(A10)

References

Duan, F.; Fuchao, W.; Hu, Z. Pose determination and plane measurement using a trapezium. Pattern Recognit. Lett. 2008, 29, 223–231. [Google Scholar] [CrossRef]
Magee, M.; Aggarwal, J. Determining the position of a robot using a single calibration object. In Proceedings of the IEEE International Conference on Robotics and Automation, Atlanta, GA, USA, 13–15 March 1984; Volume 1, pp. 140–149. [Google Scholar] [CrossRef]
Gao, X.H.; Liang, B.; Pan, L.; Li, Z.H.; Zhang, Y.C. A Monocular Structured Light Vision Method for Pose Determination of Large Non-cooperative Satellites. Int. J. Control. Autom. Syst. 2016, 14, 1535–1549. [Google Scholar] [CrossRef]
Martinez, C.; Mondragon, I.F.; Olivares-Mendez, M.A.; Campoy, P. On-board and Ground Visual Pose Estimation Techniques for UAV Control. J. Intell. Robot. Syst. 2011, 61, 301–320. [Google Scholar] [CrossRef] [Green Version]
Zhang, L.; Zhai, Z.; He, L.; Wen, P.; Niu, W. Infrared-Inertial Navigation for Commercial Aircraft Precision Landing in Low Visibility and GPS-Denied Environments. Sensors 2019, 19, 408. [Google Scholar] [CrossRef] [Green Version]
Abidi, M.A.; Chandra, T. A new efficient and direct solution for pose estimation using quadrangular targets—Algorithm and evaluation. IEEE Trans. Pattern Anal. Mach. Intell. 1995, 17, 534–538. [Google Scholar] [CrossRef] [Green Version]
Haralick, R.M. Using perspective transformations in scene analysis. Comput. Graph. Image Process. 1980, 13, 191–221. [Google Scholar] [CrossRef]
Haralick, R.M. Determining camera parameters from the perspective projection of a rectangle. Pattern Recognit. 1989, 22, 225–230. [Google Scholar] [CrossRef]
Quan, L.; Lan, Z. Linear N-point camera pose determination. IEEE Trans. Pattern Anal. Mach. Intell. 1999, 21, 774–780. [Google Scholar] [CrossRef]
Sim, R.; Little, J. Autonomous vision-based robotic exploration and mapping using hybrid maps and particle filters. Image Vis. Comput. 2009, 27, 167–177. [Google Scholar] [CrossRef]
Valencia-Garcia, R.; Martinez-Béjar, R.; Gasparetto, A. An intelligent framework for simulating robot-assisted surgical operations. Expert Syst. Appl. 2005, 28, 425–433. [Google Scholar] [CrossRef]
Pichler, A.; Akkaladevi, S.; Ikeda, M.; Hofmann, M.; Plasch, M.; Wögerer, C.; Fritz, G. Towards Shared Autonomy for Robotic Tasks in Manufacturing. Procedia Manuf. 2017, 11, 72–82. [Google Scholar] [CrossRef]
González, D.; Pérez, J.; Milanés, V. Parametric-based path generation for automated vehicles at roundabouts. Expert Syst. Appl. 2017, 71, 332–341. [Google Scholar] [CrossRef] [Green Version]
Sanchez-Lopez, J.; Pestana, J.; De La Puente, P.; Campoy, P. A reliable open-source system architecture for the fast designing and prototyping of autonomous multi-UAV systems: Simulation and experimentation. J. Intell. Robot. Syst. 2015, 84, 779–797. [Google Scholar] [CrossRef] [Green Version]
Romero-Ramirez, F.J.; Muñoz-Salinas, R.; Medina-Carnicer, R. Speeded up detection of squared fiducial markers. Image Vis. Comput. 2018, 76, 38–47. [Google Scholar] [CrossRef]
Germanese, D.; Leone, G.R.; Moroni, D.; Pascali, M.A.; Tampucci, M. Long-Term Monitoring of Crack Patterns in Historic Structures Using UAVs and Planar Markers: A Preliminary Study. J. Imaging 2018, 4, 99. [Google Scholar] [CrossRef] [Green Version]
Pflugi, S.; Vasireddy, R.; Lerch, T.; Ecker, T.; Tannast, M.; Boemke, N.; Siebenrock, K.; Zheng, G. Augmented marker tracking for peri-acetabular osteotomy surgery. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Seogwipo, South Korea, 11–15 July 2017; pp. 937–941. [Google Scholar] [CrossRef]
Lima, J.P.; Roberto, R.; Simões, F.; Almeida, M.; Figueiredo, L.; Teixeira, J.M.; Teichrieb, V. Markerless tracking system for augmented reality in the automotive industry. Expert Syst. Appl. 2017, 82, 100–114. [Google Scholar] [CrossRef]
Chen, P.; Peng, Z.; Li, D.; Yang, L. An improved augmented reality system based on AndAR. J. Vis. Commun. Image Represent. 2016, 37, 63–69. [Google Scholar] [CrossRef]
Khattak, S.; Cowan, B.; Chepurna, I.; Hogue, A. A real-time reconstructed 3D environment augmented with virtual objects rendered with correct occlusion. In Proceedings of the 2014 IEEE Games Media Entertainment Toronto, ON, Canada, 22–24 October 2014; pp. 1–8. [Google Scholar]
Shang, Y.; Yu, Q.; Zhang, X. Analytical method for camera calibration from a single image with four coplanar control lines. Appl. Opt. 2004, 43, 5364–5369. [Google Scholar] [CrossRef]
Cai, Y.; Huang, Y. A Robust Linear Camera Calibration Based on Coplanar Circles. In Proceedings of 2013 Chinese Intelligent Automation Conference: Intelligent Information Processing. Chinese Assoc Automat, Intelligent Automat Comm; Yangzhou University: Yangzhou, China, 2013; Volume 256, pp. 521–529. [Google Scholar] [CrossRef]
Takahashi, A.; Ishii, I.; Makino, H.; Nakashizuka, M. A camera calibration method using parallelogrammatic grid points. IEICE Trans. Inf. Syst. 1996, E79D, 1579–1587. [Google Scholar]
Mozerov, M.; Amato, A.; Al Haj, M.; Gonzalez, J. A Simple Method of Multiple Camera Calibration for the Joint Top View Projection. In Computer Recognition Systems 2; Springer: Berlin, Germany, 2007; Volume 45, pp. 164–170. [Google Scholar]
Becker, S.; Bove, V. Semiautomatic 3-D model extraction from uncalibrated 2-D camera views. Visual data exploration and analysis II. In Proceedings of the Society of Photo-Optical Instrumentation Engineers (SPIE), Orlando, FL, USA, 19–21 April 1995; Volume 2410, pp. 447–461. [Google Scholar] [CrossRef] [Green Version]
Delage, E.; Lee, H.; Ng, A.Y. Automatic single-image 3D reconstructions of indoor Manhattan world scenes. Robot. Res. 2007, 28, 305–321. [Google Scholar]
Wilczkowiak, M.; Boyer, E.; Sturm, P. Camera calibration and 3D reconstruction from single images using parallelepipeds. In Proceedings of the 8th IEEE International Conference on Computer Vision, Vancouver, BC, Canada, 7–14 July 2001; IEEE Computer Society: Washington, DC, USA, 2001; Volume 1, pp. 142–148. [Google Scholar]
Sturm, P.; Maybank, S. A Method for Interactive 3D Reconstruction of Piecewise Planar Objects from Single Images. In Proceedings of the 10th British Machine Vision Conference (BMVC ’99), Nottingham, UK, 13–16 September 1999; pp. 265–274. [Google Scholar]
Micusik, B.; Wildenauer, H.; Kosecka, J. Detection and matching of rectilinear structures. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; IEEE Computer Society: Washington, DC, USA, 2008; pp. 1–7. [Google Scholar] [CrossRef]
Penna, M. Determining camera parameters from the perspective projection of a quadrilateral. Pattern Recognit. 1991, 24, 533–541. [Google Scholar] [CrossRef]
Hong, Z.Q.; Yang, J.Y. An algorithm for camera calibration using a three-dimensional reference point. Pattern Recognit. 1993, 26, 1655–1660. [Google Scholar] [CrossRef]
Wefelscheid, C.; Wekel, T.; Hellwich, O. Monocular Rectangle Reconstruction Based on Direct Linear Transformation. In Proceedings of the VISAPP 2011: Proceedings of the International Conference on Computer Vision Theory and Applications, Vilamoura, Portugal, 5–7 March 2011; Institute for Systems and Technologies of Information, Control and Communication: Setubal, Portutal, 2011; pp. 271–276. [Google Scholar]
Shunliang, P.; Xiaojian, W.; Weiqun, S.; Zishan, S. A faster relative 3D position and attitude algorithm based on special four-point feature. In Signal Analysis, Masurement Theory, Photo-Electronic Technology, and Artificial Intellingence, Pts 1 and 2; Beijing Univ Aeronaut & Astronaut: Beijing, China, 2006; Volume 6357, pp. 1–2. [Google Scholar] [CrossRef]
Guillou, E.; Meneveaux, D.; Maisel, E.; Bouatouch, K. Using vanishing points for camera calibration and coarse 3D reconstruction from a single image. Vis. Comput. 2000, 16, 396–410. [Google Scholar] [CrossRef]
Zhou, K.; Wang, X.J.; Wang, Z.; Wei, H.; Yin, L. Complete Initial Solutions for Iterative Pose Estimation From Planar Objects. IEEE Access 2018, 6, 22257–22266. [Google Scholar] [CrossRef]
YUAN, J. A General Photogrammetric Method for Determining Object Position and Orientation. IEEE Trans. Robot. Autom. 1989, 5, 129–142. [Google Scholar] [CrossRef]
Wang, P.; Xu, G.; Cheng, Y.; Yu, Q. Camera pose estimation from lines: A fast, robust and general method. Mach. Vis. Appl. 2019, 30, 603–614. [Google Scholar] [CrossRef]
Ulrich, M.; Wiedemann, C.; Steger, C. CAD-Based Recognition of 3D Objects in Monocular Images. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation ICRA, Kobe, Japan, 12–17 May 2009; pp. 2090–2097. [Google Scholar]
Sakcak, B.; Bascetta, L.; Ferretti, G. Model based Detection and 3D Localization of Planar Objects for Industrial Setups. In Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics (ICINCO), Lisbon, Portugal, 29–31 July 2016; Volume 2, pp. 360–367. [Google Scholar] [CrossRef] [Green Version]
Han, P.; Zhao, G. CAD-based 3D objects recognition in monocular images for mobile augmented reality. Comput. Graph. UK 2015, 50, 36–46. [Google Scholar] [CrossRef]
He, Z.; Jiang, Z.; Zhao, X.; Zhang, S.; Wu, C. Sparse Template-Based 6-D Pose Estimation of Metal Parts Using a Monocular Camera. IEEE Trans. Ind. Electron. 2019, 390–401. [Google Scholar] [CrossRef]
Monge, G. Géométrie Descriptive; J. Klostermann Fils: Paris, France, 1847; p. 184. [Google Scholar]
Adler, A.A. The Theory of Engineering Drawing; D. Van Nostrand Company: New York, NY, USA, 1912; p. 362. [Google Scholar]
Estrems, M.; Sánchez, H.; Faura, F. Influence of Fixtures on Dimensional Accuracy in Machining Processes. Int. J. Adv. Manuf. Technol. 2003, 21, 384–390. [Google Scholar] [CrossRef]

Figure 1. Plotting a point (P) in a conical perspective from its dihedral projection. Subindex f refers to the Front View and subindex t refers to the Top View. (a) Point dihedral construction view; (b) Dihedral projections.

Figure 2. Actual conical perspective of the object froma photograph: (a) Initial image; (b) Referencemodel.

Figure 3. Obtaining auxiliary points from the four vertices of the rectangle.

Figure 4. Line orientation that subdivides the segment of the side in two equal segments.

Figure 5. Obtaining the Top View from the Front View.

Figure 6. Obtaining the Top projection of a point of the conical with a known Front projection.

Figure 7. Summary for obtaining the dihedral from the conical projection (blue: dihedral; black: conical).

Figure 8. Obtaining camera positioning with respect to rectangle coordinate system.

Figure 9. (a) Set-up used for the measurements of the position of the camera in rectangle coordinates through a CMM; (b) Points of the camera touched by the probe for the implementation of the 3-2-1 method [44].

Figure 10. Graphical representation of the theoretical and experimental values of the center points of the rectangles. Distance errors have also been included for each point.

Figure 11. Images analyzed during the experimental test.

Table 1. Obtaining auxiliary points as intersections of lines that go through two points.

Support Point	Line 1		Line 2
$M^{*}$	$P_{1}^{*}$	$P_{3}^{*}$	$P_{2}^{*}$	$P_{4}^{*}$
$V_{1}^{*}$	$P_{1}^{*}$	$P_{2}^{*}$	$P_{3}^{*}$	$P_{4}^{*}$
$V_{2}^{*}$	$P_{1}^{*}$	$P_{4}^{*}$	$P_{2}^{*}$	$P_{3}^{*}$
$P_{12}^{*}$	$P_{1}^{*}$	$P_{2}^{*}$	$M^{*}$	$V_{2}^{*}$
$P_{23}^{*}$	$P_{2}^{*}$	$P_{3}^{*}$	$M^{*}$	$V_{1}^{*}$
$P_{34}^{*}$	$P_{3}^{*}$	$P_{4}^{*}$	$M^{*}$	$V_{2}^{*}$
$P_{14}^{*}$	$P_{1}^{*}$	$P_{4}^{*}$	$M^{*}$	$V_{1}^{*}$

Table 2. Parameters used for the test.

Camera Parameters			A4 Sheet Parameters			Focus Position in Camera Coordinates
Sensor size	1/2.3”	6.16 × 4.62 mm	width	210	mm	x	30	mm
Width	4608	pixels	height	297	mm	y	30	mm
Height	3456	pixels				z	3.5	mm
Focus	3433	pix (4.6 mm)
$V_{x}$	2186	pixels
$V_{y}$	1991	pixels
1 pixel	0.0013368	mm

Table 3. Coordinates, distances, and errors obtained by CMM and image analysis where subindex e refers to experimental data calculated with the CMM and subindex t refers to theoretical values calculated by the image analysis algorithm. Coordinates x, y, z and distances d are expressed in mm. Differences of the coordinates and distances (errors) are presented in percentages.

	CMM				Image
Image	$x_{e}$	$y_{e}$	$z_{e}$	$d_{e}$	$x_{t}$	$y_{t}$	$z_{t}$	$d_{t}$	$ϵ_{x}$ (%)	$ϵ_{y}$ (%)	$ϵ_{z}$ (%)	$ϵ_{d}$ (%)
1	201.22	222.21	367.44	474.21	210.09	200.44	365.64	466.91	4.41	−9.79	−0.49	−1.54
2	−195.04	218.71	367.43	469.98	−180.47	215.48	374.60	468.32	−7.47	−1.48	1.95	−0.35
3	88.39	469.44	378.18	609.27	111.54	473.32	380.00	617.15	26.19	0.83	0.48	1.29
4	157.90	225.84	374.79	465.19	148.48	212.45	381.40	461.13	−5.97	−5.93	1.76	−0.87
5	−113.73	672.02	390.02	785.27	−120.61	687.26	391.43	800.06	6.06	2.27	0.36	1.88
6	128.21	190.00	366.23	432.05	102.81	185.02	370.68	426.85	−19.82	−2.62	1.21	−1.20
7	184.55	592.80	384.91	730.50	197.23	600.11	389.99	742.38	6.88	1.23	1.32	1.63
8	−80.07	460.32	374.07	598.53	−60.11	462.76	378.33	600.75	−24.93	0.53	1.14	0.37
9	−113.76	128.64	352.65	392.24	−89.32	120.79	354.29	384.83	−21.48	−6.10	0.47	−1.89

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Estrems Amestoy, M.; de Francisco Ortiz, Ó. Global Positioning from a Single Image of a Rectangle in Conical Perspective. Sensors 2019, 19, 5432. https://doi.org/10.3390/s19245432

AMA Style

Estrems Amestoy M, de Francisco Ortiz Ó. Global Positioning from a Single Image of a Rectangle in Conical Perspective. Sensors. 2019; 19(24):5432. https://doi.org/10.3390/s19245432

Chicago/Turabian Style

Estrems Amestoy, Manuel, and Óscar de Francisco Ortiz. 2019. "Global Positioning from a Single Image of a Rectangle in Conical Perspective" Sensors 19, no. 24: 5432. https://doi.org/10.3390/s19245432

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Global Positioning from a Single Image of a Rectangle in Conical Perspective

Abstract

1. Introduction

2. Dihedral Projection of a Rectangle. Rotations and Translations

3. Conical Projection

4. Obtaining the Possible Front View and Top View of Dihedral Projection of the Rectangle

5. Comments on the Described Method and Comparison with Previous Ones

6. Positioning of the Camera in Coordinate System Defined in the Rectangle

7. Experimental Tests

8. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. Obtaining the Expression of tanω

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI