COMPLEXITY AND SCENE DETECTION FOR VIDEO CODING OPTIMIZATION

The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.

COMPLEXITY AND SCENE DETECTION FOR VIDEO CODING OPTIMIZATION

KILLI, SOMASEKHAR

2024/2025

Abstract

The increasing demand for ultra-high-definition video content has significantly elevated the importance of efficient video compression standards. H.266/VVC (Versatile Video Coding) represents the latest advancement in this field, offering substantial improvements in compression performance. However, with enhanced efficiency comes increased encoding complexity, necessitating intelligent strategies to optimize encoding parameters. This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating two methods of video complexity assessment. The first approach, Scene-Based Complexity Classification, involves detecting scene transitions using PySceneDetect and classifying each scene according to intra and inter prediction complexities based on luminance variance and inter-frame pixel differences. This method allows encoding decisions to be guided by coarse- grained, scene-level complexity insights. To explore a more granular and automated alternative, the Video Complexity Analyzer (VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video frames. These features offer a detailed, frame-by-frame perspective on content complexity without requiring manual segmentation. A comparative analysis of both methods highlights their respective strengths, the interpretability and scene-awareness of the scene based complexity classification approach versus the precision of VCA. Scene based Complexity Classification stood tall in most of the cases. This Also led to need of more study on different approaches for enhancing the performance of H.266/VVC.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				DIPARTIMENTO DI INGEGNERIA INDUSTRIALE E DELL'INFORMAZIONE
			
	Corso di studio
	
				INDUSTRIAL AUTOMATION ENGINEERING - INGEGNERIA DELL'AUTOMAZIONE INDUSTRIALE [06417]
			
	Anno Accademico
	
				2024
			
	Titolo inglese
	
				COMPLEXITY AND SCENE DETETCTION FOR VIDEO CODING OPTIMIZATION
			
	Abstract in italiano
	
				The increasing demand for ultra-high-definition video content has significantly elevated the
importance of efficient video compression standards. H.266/VVC (Versatile Video Coding)
represents the latest advancement in this field, offering substantial improvements in
compression performance. However, with enhanced efficiency comes increased encoding
complexity, necessitating intelligent strategies to optimize encoding parameters.
This thesis explores complexity-driven optimization in H.266/VVC encoding by evaluating
two methods of video complexity assessment. The first approach, Scene-Based Complexity
Classification, involves detecting scene transitions using PySceneDetect and classifying each
scene according to intra and inter prediction complexities based on luminance variance and
inter-frame pixel differences. This method allows encoding decisions to be guided by coarse-
grained, scene-level complexity insights.
To explore a more granular and automated alternative, the Video Complexity Analyzer
(VCA) is introduced. VCA provides fine-grained, block-level complexity metrics such as
spatial texture energy, temporal gradients, and edge density, extracted from raw YUV video
frames. These features offer a detailed, frame-by-frame perspective on content complexity
without requiring manual segmentation.
A comparative analysis of both methods highlights their respective strengths, the
interpretability and scene-awareness of the scene based complexity classification approach
versus the precision of VCA. Scene based Complexity Classification stood tall in most of the
cases. This Also led to need of more study on different approaches for enhancing the
performance of H.266/VVC.
			
	Relatore
	
				FAVALLI, LORENZO
			
	Appare nelle tipologie:
	
				Lauree Magistrali

File in questo prodotto:

File	Dimensione	Formato
Thesis_Report_somasekhar_final.pdf accesso aperto Descrizione: This Thesis investigates complexity driven optimization techniques for H.266/VVC video codec. It evaluates two distinct video complexity analysis methods: a scene based classification approach and Video Complexity Analyzer(VCA). Dimensione 7.36 MB Formato Adobe PDF Visualizza/Apri	7.36 MB	Adobe PDF	Visualizza/Apri

È consentito all'utente scaricare e condividere i documenti disponibili a testo pieno in UNITESI UNIPV nel rispetto della licenza Creative Commons del tipo CC BY NC ND.
Per maggiori informazioni e per verifiche sull'eventuale disponibilità del file scrivere a: unitesi@unipv.it.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14239/33535