Feb 15, Technology/Computer Sciences
In his PhD thesis, Leonardo de Maeztu-Reinares is proposing 3D reconstruction algorithms; they are on a par with the results of the best available techniques and can be executed more rapidly on a computer. His work is based on stereoscopic vision, a technique for obtaining three-dimensional images and which, in order to get accurate results, calls for a computationally heavy load and considerable algorithmic complexity. His work has been published in international journals like the IEEE Transactions on Pattern Analysis and Machine Intelligence.
"The basic principle behind stereoscopy is the use of two or more cameras that can simultaneously pick up the same scene from different positions, similar to what human eyes do," explains this researcher. That way, two or more images are captured for each instant in time and are compared with each other to work out how far the objects are from the cameras, and that way capture the depth lacking in a classical photograph, which is two-dimensional. "This availability of two-dimensional perspectives enables the three-dimensional reconstruction of the scene to be made by using algorithms that seek matching points between images.
Thanks to the continual increase in the calculating capacity of computers, their use is becoming increasingly routine in tasks of surveillance, measuring or inspection, which until recently were the reserve of human beings. With just a camera or video camera, today's PCs can be used for surveillance work on large crowds, inspection of parts at the end of the production line, or the recording of the licence plates of vehicles passing through a particular zone. The problem is that using just one camera limits the capacity of these systems, as they obtain a two-dimensional representation from a three-dimensional world. To overcome this, there are three-dimensional sensors; one of the most widely used owing to its similarity to the human visual system includes two cameras that record the same scene from two slightly separated viewing points.
"In the 1970s and 1980s," says Leonardo de Maeztu, "many scientists thought about imitating the way in which our brains obtain three-dimensional reconstructions using images supplied by each eye. However, the differences between our brains and computers soon became apparent. This is why solutions adapted to the way computers work became gradually more important."
This researcher's PhD thesis has concentrated on developing 3D reconstruction algorithms that can be rapidly executed by a computer. Together with the proposal for new solutions that are more straightforward than existing ones, he has worked intensively with computers that have recently come onto the market, which, thanks to the fact that they include new types of processors, enable complex tasks to be executed fast. "One of the algorithms proposed," he explains, "yields better results than other previous ones in the same class and, what is more, offers a very interesting competitive advantage: it can be implemented in real time using a standard graphics card. Although they are algorithms that require a great calculating capacity, if the full potential of current graphics processors is used, they can even be executed in real time, in other words, to process as many images per second as those caught by the corresponding camera."
As this is a new line of research, he did some intensive documentation work during the early years of this thesis. He also had a four-month stay in Bologna (Italy) supervised by Prof Stefano Mattoccia, a leading researcher in the field of stereo vision,and with whom he continued to collaborate until completing his thesis.
De Maeztuwas awarded top honours with an international distinction for his PhD thesis entitled "Towards accurate and real-time local stereo correspondence algorithms: computational efficiency and massively parallel architectures." As a result of it, eight articles and papers have been published in international journals and conference proceedings. A publication in the most important journal in the field, IEEE Transactions on Pattern Analysis and Machine Intelligence, should be highlighted and so should a paper in one of the two most important conferences in the field, the IEEE International Conference on Computer Vision.
More information: IEEE Transactions on Pattern Analysis and Machine Intelligence 34(2): 410-416.
Provided by Elhuyar Fundazioa
[Home] [Full version] [RSS feed] [Forum]