@PhdThesis{ArminMehri2023,
author="Armin Mehri",
editor="Angel Sappa",
title="Deep learning based architectures for cross-domain image processing",
year="2023",
publisher="IMPRIMA",
abstract="Human vision is restricted to the visual-optical spectrum. Machine vision is not.Cameras sensitive to diverse infrared spectral bands can improve the capacities ofautonomous systems and provide a comprehensive view. Relevant scene contentcan be made visible, particularly in situations when sensors of other modalities,such as a visual-optical camera, require a source of illumination. As a result, increasing the level of automation not only avoids human errors but also reducesmachine-induced errors. Furthermore, multi-spectral sensor systems with infraredimagery as one modality are a rich source of information and can conceivablyincrease the robustness of many autonomous systems. Robotics, automobiles,biometrics, security, surveillance, and the military are some examples of fieldsthat can profit from the use of infrared imagery in their respective applications.Although multimodal spectral sensors have come a long way, there are still severalbottlenecks that prevent us from combining their output information and usingthem as comprehensive images. The primary issue with infrared imaging is the lackof potential benefits due to their cost influence on sensor resolution, which growsexponentially with greater resolution. Due to the more costly sensor technologyrequired for their development, their resolutions are substantially lower than thoseof regular digital cameras.This thesis aims to improve beyond-visible-spectrum machine vision by integrating multi-modal spectral sensors. The emphasis is on transforming the produced images to enhance their resolution to match expected human perception, bring the color representation close to human understanding of natural color, and improve machine vision application performance. This research focuses mainly on two tasks, image Colorization and Image Super resolution for both single- and cross-domain problems. We first start with an extensive review of the state of the art in both tasks, point out the shortcomings of existing approaches, and then present our solutions to address their limitations. Our solutions demonstrate that low-cost channel information (i.e., visible image) can be used to improve expensive channelinformation (i.e., infrared image), resulting in images with higher quality and closer to human perception at a lower cost than a high-cost infrared camera.",
optnote="MSIAU",
optnote="exported from refbase (http://158.109.8.37/show.php?record=3959), last updated on Fri, 26 Jan 2024 10:27:52 +0100",
isbn="978-84-126409-1-5"
}