This thesis investigates the design and implementation of a real-time multi-camera AI-based person activity recognition system on a low-power embedded GPU platform. The work focuses on the feasibility of deploying a complete perception pipeline, including human detection, action classification, visualization, and streaming, under the computational and energy constraints of embedded hardware. Starting from a baseline GPU implementation, the system is progressively redesigned through batching strategies, CUDA kernel adaptations, multi-thread and multi-stream execution, cross-architecture deployment, and embedded-specific optimizations. The final implementation is deployed on the NVIDIA Jetson Xavier NX and validated in a three-camera configuration. The results show that real-time operation is achievable on the selected embedded platform, while also highlighting the practical limits imposed by resource contention, power modes, and model behaviour. More broadly, the thesis shows that embedded AI deployment is not only a model inference problem, but a system-level engineering challenge involving data movement, concurrency, scheduling, and hardware-aware design.
Advanced vehicle surveillance: a real-time multi-camera AI-based person activity recognition on a low-power embedded GPU
FIOCCHI, RICCARDO
2024/2025
Abstract
This thesis investigates the design and implementation of a real-time multi-camera AI-based person activity recognition system on a low-power embedded GPU platform. The work focuses on the feasibility of deploying a complete perception pipeline, including human detection, action classification, visualization, and streaming, under the computational and energy constraints of embedded hardware. Starting from a baseline GPU implementation, the system is progressively redesigned through batching strategies, CUDA kernel adaptations, multi-thread and multi-stream execution, cross-architecture deployment, and embedded-specific optimizations. The final implementation is deployed on the NVIDIA Jetson Xavier NX and validated in a three-camera configuration. The results show that real-time operation is achievable on the selected embedded platform, while also highlighting the practical limits imposed by resource contention, power modes, and model behaviour. More broadly, the thesis shows that embedded AI deployment is not only a model inference problem, but a system-level engineering challenge involving data movement, concurrency, scheduling, and hardware-aware design.| File | Dimensione | Formato | |
|---|---|---|---|
|
MASTER THESIS_riccardo_fiocchi.pdf
accesso aperto
Descrizione: This thesis presents a real-time multi-camera person detection and activity recognition system on a low-power embedded GPU, showing how architectural redesign and CUDA optimizations enable stable execution on NVIDIA Jetson Xavier NX.
Dimensione
2.51 MB
Formato
Adobe PDF
|
2.51 MB | Adobe PDF | Visualizza/Apri |
È consentito all'utente scaricare e condividere i documenti disponibili a testo pieno in UNITESI UNIPV nel rispetto della licenza Creative Commons del tipo CC BY NC ND.
Per maggiori informazioni e per verifiche sull'eventuale disponibilità del file scrivere a: unitesi@unipv.it.
https://hdl.handle.net/20.500.14239/34976