Advancing Compression-Driven Approaches in 2D and 3D Vision

Guardado en:
Detalles Bibliográficos
Publicado en:PQDT - Global (2025)
Autor principal: Cheng, Ka Leong
Publicado:
ProQuest Dissertations & Theses
Materias:
Acceso en línea:Citation/Abstract
Full Text - PDF
Full text outside of ProQuest
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!

MARC

LEADER 00000nab a2200000uu 4500
001 3232655002
003 UK-CbPIL
020 |a 9798288810046 
035 |a 3232655002 
045 2 |b d20250101  |b d20251231 
084 |a 189128  |2 nlm 
100 1 |a Cheng, Ka Leong 
245 1 |a Advancing Compression-Driven Approaches in 2D and 3D Vision 
260 |b ProQuest Dissertations & Theses  |c 2025 
513 |a Dissertation/Thesis 
520 3 |a The exponential growth of visual data, including 2D images and 3D modalities such as videos and multi-view images, has placed a premium on efficient representation and processing techniques that extend beyond traditional data compression. This thesis explores compression-driven approaches as innovative solutions for 2D and 3D vision, leveraging compression not merely for data reduction but as a transformative tool to enhance efficiency, versatility, and quality across various vision tasks. By leveraging the principles of compression, this thesis addresses a range of challenges, including reversible image transformations, learned lossy compression, joint task optimization with compression, as well as efficient 3D editing with compact yet expressive representations. To this end, we first develop frameworks that utilize invertible neural networks to encode and recover visual information with high fidelity, enabling tasks such as reversible image conversion and lossy image compression. By exploring the inherent invertibility of neural networks, we demonstrate that compression can serve as a reversible conduit for hiding and reconstructing multiple images within a single embedding, as well as improving the quality and efficiency of learned image compression codecs. We then extend image compression methods beyond data reduction by investigating their synergy with other tasks. Specifically, we propose a joint learning framework for image compression and denoising, leveraging the innate denoising capabilities of compression models. This approach achieves superior results in both domains while revealing the latent connections between data reduction and noise suppression. Finally, we expand the scope of compression to 3D vision, introducing a novel editing-friendly framework that encapsulates the appearance of 3D scenes into compact 2D canonical images. By treating the canonical image as a compressive representation of the 3D scene, this approach enables efficient 3D editing through standard 2D tools, eliminating the need for costly re-optimization while maintaining fidelity to the original scene. 
653 |a Quality standards 
653 |a Deep learning 
653 |a Wavelet transforms 
653 |a Video recordings 
653 |a Bandwidths 
653 |a Signal processing 
653 |a Neural networks 
653 |a Flexibility 
653 |a Decomposition 
653 |a Visualization 
653 |a Geometry 
653 |a Entropy 
653 |a Data compression 
653 |a Computer science 
773 0 |t PQDT - Global  |g (2025) 
786 0 |d ProQuest  |t ProQuest Dissertations & Theses Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3232655002/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3232655002/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full text outside of ProQuest  |u https://lbezone.hkust.edu.hk/bib/991013426672603412