The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.

The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.

COMPLEXITY AND SCENE DETECTION FOR VIDEO CODING OPTIMIZATION

KILLI, SOMASEKHAR
2024/2025

Abstract

The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.
2024
COMPLEXITY AND SCENE DETETCTION FOR VIDEO CODING OPTIMIZATION
The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.
File in questo prodotto:
File Dimensione Formato  
Thesis_Report_somasekhar_final.pdf

accesso aperto

Descrizione: This Thesis investigates complexity driven optimization techniques for H.266/VVC video codec. It evaluates two distinct video complexity analysis methods: a scene based classification approach and Video Complexity Analyzer(VCA).
Dimensione 7.36 MB
Formato Adobe PDF
7.36 MB Adobe PDF Visualizza/Apri

È consentito all'utente scaricare e condividere i documenti disponibili a testo pieno in UNITESI UNIPV nel rispetto della licenza Creative Commons del tipo CC BY NC ND.
Per maggiori informazioni e per verifiche sull'eventuale disponibilità del file scrivere a: unitesi@unipv.it.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14239/33535