Summary
In the field of computer vision, any two images of the same planar surface in space are related by a homography (assuming a pinhole camera model). This has many practical applications, such as , , or camera motion—rotation and translation—between two images. Once camera resectioning has been done from an estimated homography matrix, this information may be used for navigation, or to insert models of 3D objects into an image or video, so that they are rendered with the correct perspective and appear to have been part of the original scene (see Augmented reality). We have two cameras a and b, looking at points in a plane. Passing from the projection of in b to the projection of in a: where and are the z coordinates of P in each camera frame and where the homography matrix is given by is the rotation matrix by which b is rotated in relation to a; t is the translation vector from a to b; n and d are the normal vector of the plane and the distance from origin to the plane respectively. Ka and Kb are the cameras' intrinsic parameter matrices. The figure shows camera b looking at the plane at distance d. Note: From above figure, assuming as plane model, is the projection of vector along , and equal to . So . And we have where . This formula is only valid if camera b has no rotation and no translation. In the general case where and are the respective rotations and translations of camera a and b, and the homography matrix becomes where d is the distance of the camera b to the plane. When the image region in which the homography is computed is small or the image has been acquired with a large focal length, an affine homography is a more appropriate model of image displacements. An affine homography is a special type of a general homography whose last row is fixed to homest is a GPL C/C++ library for robust, non-linear (based on the Levenberg–Marquardt algorithm) homography estimation from matched point pairs (Manolis Lourakis). OpenCV is a complete (open and free) computer vision software library that has many routines related to homography estimation (cvFindHomography) and re-projection (cvPerspectiveTransform).
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related lectures (8)
Robust Vision: State of the Art
Explores the challenges of robust vision, including distribution shifts, failure examples, and strategies for improving model robustness through diverse data pretraining.
Faster Gaze Prediction: Dense Networks and Fisher Pruning
Explores the development of faster architectures for saliency prediction using deep networks and pruning methods.
Show more
Related publications (18)
Related concepts (1)
Camera resectioning
Camera resectioning is the process of estimating the parameters of a pinhole camera model approximating the camera that produced a given photograph or video; it determines which incoming light ray is associated with each pixel on the resulting image. Basically, the process determines the pose of the pinhole camera. Usually, the camera parameters are represented in a 3 × 4 projection matrix called the camera matrix. The extrinsic parameters define the camera pose (position and orientation) while the intrinsic parameters specify the camera image format (focal length, pixel size, and image origin).